From ronvach at abacom.com Thu Apr 16 20:01:41 2015 Content-Type: multipart/mixed; boundary="===============3427236417631410152==" MIME-Version: 1.0 From: Ron V To: users at ovirt.org Subject: [ovirt-users] oVirt 3.5.2 Hypervisor crash, guest VMs not migrating Date: Thu, 16 Apr 2015 20:01:38 -0400 Message-ID: <55304D62.7040905@abacom.com> --===============3427236417631410152== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Hello, I am testing guest VM migration in the event of a host crash, and I am = surprised to see that guests that are selected to be highly available do = not migrate when a host is forcibly turned off. I have a 2 host cluster using iSCSI for storage, and when one of the = hosts, either the SPM or normal, is forcibly turned off, although the = engine sees the host as non-responsive, the VMs that were running on it = remain on that crashed host, and a question mark (?) appears next to = them. Other than checking "highly available", is there another step = that needs to be made for a VM to be restarted on a working host should = the host it is running on fail? Thanks, R. --===============3427236417631410152==-- From Ernest.Beinrohr at axonpro.sk Fri Apr 17 01:30:03 2015 Content-Type: multipart/mixed; boundary="===============7005985096185594898==" MIME-Version: 1.0 From: Ernest Beinrohr To: users at ovirt.org Subject: Re: [ovirt-users] oVirt 3.5.2 Hypervisor crash, guest VMs not migrating Date: Fri, 17 Apr 2015 07:29:58 +0200 Message-ID: <55309A56.6080205@axonpro.sk> In-Reply-To: 55304D62.7040905@abacom.com --===============7005985096185594898== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable This is a multi-part message in MIME format. --------------090508000604030501070409 Content-Type: text/plain; charset=3DUTF-8; format=3Dflowed Content-Transfer-Encoding: 8bit D=C5=88a 17.04.2015 o 02:01 Ron V nap=C3=ADsal(a): > Hello, > > I am testing guest VM migration in the event of a host crash, and I am = > surprised to see that guests that are selected to be highly available = > do not migrate when a host is forcibly turned off. > > I have a 2 host cluster using iSCSI for storage, and when one of the = > hosts, either the SPM or normal, is forcibly turned off, although the = > engine sees the host as non-responsive, the VMs that were running on = > it remain on that crashed host, and a question mark (?) appears next = > to them. Other than checking "highly available", is there another = > step that needs to be made for a VM to be restarted on a working host = > should the host it is running on fail? > Migration does not work when the host crashes. If you have fencing = enabled, the engine can restart the dead host and start the vms on = another host. This works for us. But you have to have power management = for the hosts. -- = Ernest Beinrohr, AXON PRO Ing , RHCE = , RHCVA = , LPIC = , VCA , +421-2-62410360 +421-903-482603 --------------090508000604030501070409 Content-Type: text/html; charset=3DUTF-8 Content-Transfer-Encoding: 8bit
D=C5=88a 17.04.2015 o 02:01 Ron V nap=C3=ADsal(a):
He= llo,

I am testing guest VM migration in the event of a host crash, and I am surprised to see that guests that are selected to be highly available do not migrate when a host is forcibly turned off.

I have a 2 host cluster using iSCSI for storage, and when one of the hosts, either the SPM or normal, is forcibly turned off, although the engine sees the host as non-responsive, the VMs that were running on it remain on that crashed host, and a question mark (?) appears next to them.=C2=A0 Other than checking "highly available", is there another step that needs to be made for a VM to be restarted on a working host should the host it is running on fail?

Migration does not work when the host crashes. If you have fencing enabled, the engine can restart the dead host and start the vms on another host. This works for us. But you have to have power management for the hosts.
--
Ernest Beinrohr, AXON PRO
Ing, RHCE, RHCVA, LPIC, VCA,
+421-2-62410360 +421-903-482603
--------------090508000604030501070409-- --===============7005985096185594898== Content-Type: multipart/alternative MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="attachment.bin" VGhpcyBpcyBhIG11bHRpLXBhcnQgbWVzc2FnZSBpbiBNSU1FIGZvcm1hdC4KLS0tLS0tLS0tLS0t LS0wOTA1MDgwMDA2MDQwMzA1MDEwNzA0MDkKQ29udGVudC1UeXBlOiB0ZXh0L3BsYWluOyBjaGFy c2V0PVVURi04OyBmb3JtYXQ9Zmxvd2VkCkNvbnRlbnQtVHJhbnNmZXItRW5jb2Rpbmc6IDhiaXQK CkTFiGEgMTcuMDQuMjAxNSBvIDAyOjAxIFJvbiBWIG5hcMOtc2FsKGEpOgo+IEhlbGxvLAo+Cj4g SSBhbSB0ZXN0aW5nIGd1ZXN0IFZNIG1pZ3JhdGlvbiBpbiB0aGUgZXZlbnQgb2YgYSBob3N0IGNy YXNoLCBhbmQgSSBhbSAKPiBzdXJwcmlzZWQgdG8gc2VlIHRoYXQgZ3Vlc3RzIHRoYXQgYXJlIHNl bGVjdGVkIHRvIGJlIGhpZ2hseSBhdmFpbGFibGUgCj4gZG8gbm90IG1pZ3JhdGUgd2hlbiBhIGhv c3QgaXMgZm9yY2libHkgdHVybmVkIG9mZi4KPgo+IEkgaGF2ZSBhIDIgaG9zdCBjbHVzdGVyIHVz aW5nIGlTQ1NJIGZvciBzdG9yYWdlLCBhbmQgd2hlbiBvbmUgb2YgdGhlIAo+IGhvc3RzLCBlaXRo ZXIgdGhlIFNQTSBvciBub3JtYWwsIGlzIGZvcmNpYmx5IHR1cm5lZCBvZmYsIGFsdGhvdWdoIHRo ZSAKPiBlbmdpbmUgc2VlcyB0aGUgaG9zdCBhcyBub24tcmVzcG9uc2l2ZSwgdGhlIFZNcyB0aGF0 IHdlcmUgcnVubmluZyBvbiAKPiBpdCByZW1haW4gb24gdGhhdCBjcmFzaGVkIGhvc3QsIGFuZCBh IHF1ZXN0aW9uIG1hcmsgKD8pIGFwcGVhcnMgbmV4dCAKPiB0byB0aGVtLiAgT3RoZXIgdGhhbiBj aGVja2luZyAiaGlnaGx5IGF2YWlsYWJsZSIsIGlzIHRoZXJlIGFub3RoZXIgCj4gc3RlcCB0aGF0 IG5lZWRzIHRvIGJlIG1hZGUgZm9yIGEgVk0gdG8gYmUgcmVzdGFydGVkIG9uIGEgd29ya2luZyBo b3N0IAo+IHNob3VsZCB0aGUgaG9zdCBpdCBpcyBydW5uaW5nIG9uIGZhaWw/Cj4KTWlncmF0aW9u IGRvZXMgbm90IHdvcmsgd2hlbiB0aGUgaG9zdCBjcmFzaGVzLiBJZiB5b3UgaGF2ZSBmZW5jaW5n IAplbmFibGVkLCB0aGUgZW5naW5lIGNhbiByZXN0YXJ0IHRoZSBkZWFkIGhvc3QgYW5kIHN0YXJ0 IHRoZSB2bXMgb24gCmFub3RoZXIgaG9zdC4gVGhpcyB3b3JrcyBmb3IgdXMuIEJ1dCB5b3UgaGF2 ZSB0byBoYXZlIHBvd2VyIG1hbmFnZW1lbnQgCmZvciB0aGUgaG9zdHMuCi0tIApFcm5lc3QgQmVp bnJvaHIsIEFYT04gUFJPCkluZyA8aHR0cDovL3d3dy5iZWlucm9oci5zay9pbmcucGhwPiwgUkhD RSAKPGh0dHA6Ly93d3cuYmVpbnJvaHIuc2svcmhjZS5waHA+LCBSSENWQSAKPGh0dHA6Ly93d3cu YmVpbnJvaHIuc2svcmhjZS5waHA+LCBMUElDIAo8aHR0cDovL3d3dy5iZWlucm9oci5zay9scGlj LnBocD4sIFZDQSA8aHR0cDovL3d3dy5iZWlucm9oci5zay92Y2EucGhwPiwKKzQyMS0yLTYyNDEw MzYwICs0MjEtOTAzLTQ4MjYwMwoKLS0tLS0tLS0tLS0tLS0wOTA1MDgwMDA2MDQwMzA1MDEwNzA0 MDkKQ29udGVudC1UeXBlOiB0ZXh0L2h0bWw7IGNoYXJzZXQ9VVRGLTgKQ29udGVudC1UcmFuc2Zl ci1FbmNvZGluZzogOGJpdAoKPGh0bWw+CiAgPGhlYWQ+CiAgICA8bWV0YSBjb250ZW50PSJ0ZXh0 L2h0bWw7IGNoYXJzZXQ9VVRGLTgiIGh0dHAtZXF1aXY9IkNvbnRlbnQtVHlwZSI+CiAgPC9oZWFk PgogIDxib2R5IGJnY29sb3I9IiNGRkZGRkYiIHRleHQ9IiMwMDAwMDAiPgogICAgPGRpdiBjbGFz cz0ibW96LWNpdGUtcHJlZml4Ij5ExYhhIDE3LjA0LjIwMTUgbyAwMjowMSBSb24gVgogICAgICBu YXDDrXNhbChhKTo8YnI+CiAgICA8L2Rpdj4KICAgIDxibG9ja3F1b3RlIGNpdGU9Im1pZDo1NTMw NEQ2Mi43MDQwOTA1QGFiYWNvbS5jb20iIHR5cGU9ImNpdGUiPkhlbGxvLAogICAgICA8YnI+CiAg ICAgIDxicj4KICAgICAgSSBhbSB0ZXN0aW5nIGd1ZXN0IFZNIG1pZ3JhdGlvbiBpbiB0aGUgZXZl bnQgb2YgYSBob3N0IGNyYXNoLCBhbmQKICAgICAgSSBhbSBzdXJwcmlzZWQgdG8gc2VlIHRoYXQg Z3Vlc3RzIHRoYXQgYXJlIHNlbGVjdGVkIHRvIGJlIGhpZ2hseQogICAgICBhdmFpbGFibGUgZG8g bm90IG1pZ3JhdGUgd2hlbiBhIGhvc3QgaXMgZm9yY2libHkgdHVybmVkIG9mZi4KICAgICAgPGJy PgogICAgICA8YnI+CiAgICAgIEkgaGF2ZSBhIDIgaG9zdCBjbHVzdGVyIHVzaW5nIGlTQ1NJIGZv ciBzdG9yYWdlLCBhbmQgd2hlbiBvbmUgb2YKICAgICAgdGhlIGhvc3RzLCBlaXRoZXIgdGhlIFNQ TSBvciBub3JtYWwsIGlzIGZvcmNpYmx5IHR1cm5lZCBvZmYsCiAgICAgIGFsdGhvdWdoIHRoZSBl bmdpbmUgc2VlcyB0aGUgaG9zdCBhcyBub24tcmVzcG9uc2l2ZSwgdGhlIFZNcyB0aGF0CiAgICAg IHdlcmUgcnVubmluZyBvbiBpdCByZW1haW4gb24gdGhhdCBjcmFzaGVkIGhvc3QsIGFuZCBhIHF1 ZXN0aW9uCiAgICAgIG1hcmsgKD8pIGFwcGVhcnMgbmV4dCB0byB0aGVtLsKgIE90aGVyIHRoYW4g Y2hlY2tpbmcgImhpZ2hseQogICAgICBhdmFpbGFibGUiLCBpcyB0aGVyZSBhbm90aGVyIHN0ZXAg dGhhdCBuZWVkcyB0byBiZSBtYWRlIGZvciBhIFZNCiAgICAgIHRvIGJlIHJlc3RhcnRlZCBvbiBh IHdvcmtpbmcgaG9zdCBzaG91bGQgdGhlIGhvc3QgaXQgaXMgcnVubmluZyBvbgogICAgICBmYWls PwogICAgICA8YnI+CiAgICAgIDxicj4KICAgIDwvYmxvY2txdW90ZT4KICAgIE1pZ3JhdGlvbiBk b2VzIG5vdCB3b3JrIHdoZW4gdGhlIGhvc3QgY3Jhc2hlcy4gSWYgeW91IGhhdmUgZmVuY2luZwog ICAgZW5hYmxlZCwgdGhlIGVuZ2luZSBjYW4gcmVzdGFydCB0aGUgZGVhZCBob3N0IGFuZCBzdGFy dCB0aGUgdm1zIG9uCiAgICBhbm90aGVyIGhvc3QuIFRoaXMgd29ya3MgZm9yIHVzLiBCdXQgeW91 IGhhdmUgdG8gaGF2ZSBwb3dlcgogICAgbWFuYWdlbWVudCBmb3IgdGhlIGhvc3RzLjxicj4KICAg IDxkaXYgY2xhc3M9Im1vei1zaWduYXR1cmUiPi0tIDxicj4KICAgICAgPGRpdiBpZD0ib2Vybmlp X2Zvb3RlciIgc3R5bGU9ImNvbG9yOiBncmF5OyI+CiAgICAgICAgPHNwYW4gc3R5bGU9ImZvbnQt ZmFtaWx5OiBMdWNpZGEgQ29uc29sZSwgTHV4aSBNb25vLCBDb3VyaWVyLAogICAgICAgICAgbW9u b3NwYWNlOyBmb250LXNpemU6IDkwJTsiPgogICAgICAgICAgRXJuZXN0IEJlaW5yb2hyLCBBWE9O IFBSTzxicj4KICAgICAgICAgIDxhIHN0eWxlPSJ0ZXh0LWRlY29yYXRpb246IG5vbmU7IGNvbG9y OiBncmF5OyIKICAgICAgICAgICAgaHJlZj0iaHR0cDovL3d3dy5iZWlucm9oci5zay9pbmcucGhw Ij5Jbmc8L2E+LCA8YQogICAgICAgICAgICBzdHlsZT0idGV4dC1kZWNvcmF0aW9uOiBub25lOyBj b2xvcjogZ3JheTsiCiAgICAgICAgICAgIGhyZWY9Imh0dHA6Ly93d3cuYmVpbnJvaHIuc2svcmhj ZS5waHAiPlJIQ0U8L2E+LCA8YQogICAgICAgICAgICBzdHlsZT0idGV4dC1kZWNvcmF0aW9uOiBu b25lOyBjb2xvcjogZ3JheTsiCiAgICAgICAgICAgIGhyZWY9Imh0dHA6Ly93d3cuYmVpbnJvaHIu c2svcmhjZS5waHAiPlJIQ1ZBPC9hPiwgPGEKICAgICAgICAgICAgc3R5bGU9InRleHQtZGVjb3Jh dGlvbjogbm9uZTsgY29sb3I6IGdyYXk7IgogICAgICAgICAgICBocmVmPSJodHRwOi8vd3d3LmJl aW5yb2hyLnNrL2xwaWMucGhwIj5MUElDPC9hPiwgPGEKICAgICAgICAgICAgc3R5bGU9InRleHQt ZGVjb3JhdGlvbjogbm9uZTsgY29sb3I6IGdyYXk7IgogICAgICAgICAgICBocmVmPSJodHRwOi8v d3d3LmJlaW5yb2hyLnNrL3ZjYS5waHAiPlZDQTwvYT4sIDxicj4KICAgICAgICAgICs0MjEtMi02 MjQxMDM2MCArNDIxLTkwMy00ODI2MDMKICAgICAgICAgIDxicj4KICAgICAgICA8L3NwYW4+IDwv ZGl2PgogICAgICA8aW1nCiAgICAgICAgc3JjPSJodHRwOi8vbm9qc3N0YXRzLmFwcHNwb3QuY29t L1VBLTQ0NDk3MDk2LTEvZW1haWwuYmVpbnJvaHIuc2siCiAgICAgICAgbW96LWRvLW5vdC1zZW5k PSJ0cnVlIiBib3JkZXI9IjAiIGhlaWdodD0iMSIgd2lkdGg9IjEiPgogICAgPC9kaXY+CiAgPC9i b2R5Pgo8L2h0bWw+CgotLS0tLS0tLS0tLS0tLTA5MDUwODAwMDYwNDAzMDUwMTA3MDQwOS0tCg== --===============7005985096185594898==-- From patrick_russell at volusion.com Fri Apr 17 10:46:12 2015 Content-Type: multipart/mixed; boundary="===============5021710001851834868==" MIME-Version: 1.0 From: Patrick Russell To: users at ovirt.org Subject: Re: [ovirt-users] oVirt 3.5.2 Hypervisor crash, guest VMs not migrating Date: Fri, 17 Apr 2015 14:46:12 +0000 Message-ID: <416399B9-B5C5-43CE-8412-D4FDDB55B642@volusion.com> In-Reply-To: 55304D62.7040905@abacom.com --===============5021710001851834868== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable If you turn the host back on, do the VM then power up on the host that was = never down? If so, we have filed a bug around this in our 3.5.1 environment. https://bugzilla.redhat.com/show_bug.cgi?id=3D1192596 -Patrick > On Apr 16, 2015, at 7:01 PM, Ron V wrote: > = > Hello, > = > I am testing guest VM migration in the event of a host crash, and I am su= rprised to see that guests that are selected to be highly available do not = migrate when a host is forcibly turned off. > = > I have a 2 host cluster using iSCSI for storage, and when one of the host= s, either the SPM or normal, is forcibly turned off, although the engine se= es the host as non-responsive, the VMs that were running on it remain on th= at crashed host, and a question mark (?) appears next to them. Other than = checking "highly available", is there another step that needs to be made fo= r a VM to be restarted on a working host should the host it is running on f= ail? > = > Thanks, > = > R. > = > _______________________________________________ > Users mailing list > Users(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/users --===============5021710001851834868==-- From ronvach at abacom.com Fri Apr 17 11:00:18 2015 Content-Type: multipart/mixed; boundary="===============7094232510652397215==" MIME-Version: 1.0 From: Ron V To: users at ovirt.org Subject: Re: [ovirt-users] oVirt 3.5.2 Hypervisor crash, guest VMs not migrating Date: Fri, 17 Apr 2015 11:00:18 -0400 Message-ID: <55312002.4000509@abacom.com> In-Reply-To: 416399B9-B5C5-43CE-8412-D4FDDB55B642@volusion.com --===============7094232510652397215== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable On 4/17/2015 10:46 AM, Patrick Russell wrote: > If you turn the host back on, do the VM then power up on the host that wa= s never down? > > If so, we have filed a bug around this in our 3.5.1 environment. > > https://bugzilla.redhat.com/show_bug.cgi?id=3D1192596 > > -Patrick Hello, yes this is exactly what happens. The engine is reporting trying to = fence the down host, and while the host is marked non-responsive. In my = scenario I am testing by yanking the power on one of the hosts, and the = idrac5 that is used to fence becomes unresponsive as well as a result, = which might be a contributing factor. In any case, VMs remain in a (?) = state and so long as I don't click "host has been manually rebooted") = they don't come online elsewhere even after a few hours. R. --===============7094232510652397215==-- From ronvach at abacom.com Fri Apr 17 11:06:41 2015 Content-Type: multipart/mixed; boundary="===============1089421530167917444==" MIME-Version: 1.0 From: Ron V To: users at ovirt.org Subject: Re: [ovirt-users] oVirt 3.5.1 Hypervisor crash, guest VMs not migrating Date: Fri, 17 Apr 2015 11:06:41 -0400 Message-ID: <55312181.4070300@abacom.com> In-Reply-To: 55312002.4000509@abacom.com --===============1089421530167917444== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable On 4/17/2015 11:00 AM, Ron V wrote: > > > On 4/17/2015 10:46 AM, Patrick Russell wrote: >> If you turn the host back on, do the VM then power up on the host = >> that was never down? >> >> If so, we have filed a bug around this in our 3.5.1 environment. >> >> https://bugzilla.redhat.com/show_bug.cgi?id=3D1192596 >> >> -Patrick > I made a typo in the subject, I am also running 3.5.1. apologies. I = have edited the subject, hopefully thats allowed on this list. R. --===============1089421530167917444==--