From michal.skrivanek at redhat.com Tue Oct 4 12:06:07 2016 Content-Type: multipart/mixed; boundary="===============0661634334032152280==" MIME-Version: 1.0 From: Michal Skrivanek To: users at ovirt.org Subject: Re: [ovirt-users] VM pauses/hangs after migration Date: Tue, 04 Oct 2016 18:06:04 +0200 Message-ID: <10004F01-9488-4C9A-9970-85F6F47D9770@redhat.com> In-Reply-To: CAPU8Gx742RqzTAnxNkoMishbEna8i5ANhnhAGh2vuSj3s3cEbg@mail.gmail.com --===============0661634334032152280== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable --Apple-Mail=3D_C55C9920-F117-4D6C-BD94-0D2709735EBA Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=3Dutf-8 > On 3 Oct 2016, at 10:39, Davide Ferrari wrote: >=3D20 >=3D20 >=3D20 > 2016-09-30 15:35 GMT+02:00 Michal Skrivanek =3D >: >=3D20 >=3D20 > that is a very low level error really pointing at HW issues. It may or = =3D may not be detected by memtest=3DE2=3D80=3DA6but I would give it a try >=3D20 >=3D20 > I left memtest86 running for 2 days and no error detected :( > =3D20 >> The only difference that this host (vmhost01) has is that it was the =3D first host installed in my self-hosted engine installation. But I have =3D already reinstalled it from GUI and menawhile I've upgraded to 4.0.4 =3D from 4.0.3. >=3D20 > does it happen only for the big 96GB VM? The others which you said are = =3D working, are they all small? > Might be worth trying other system stability tests, playing with =3D safer/slower settings in BIOS, use lower CPU cluster, etc >=3D20 >=3D20 > Yep, it happens only for the 96GB VM. Other VMs with fewer RAM (16GB =3D for example) can be created on or migrated to that host flawlessly. I'll = =3D try to play a little with BIOS settings but otherwise I'll have the HW =3D replaced. I was only trying to rule out possible oVirt SW problems due =3D to that host being the first I deployed (from CLI) when I installed the =3D cluster. I understand. Unfortunately it really does look like some sort of =3D incompatibility rather than a sw issue:/ >=3D20 > Thanks! >=3D20 > --=3D20 > Davide Ferrari > Senior Systems Engineer > _______________________________________________ > Users mailing list > Users(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/users --Apple-Mail=3D_C55C9920-F117-4D6C-BD94-0D2709735EBA Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=3Dutf-8
On 3 Oct 2016, at 10:39, Davide Ferrari <davide(a)billymob.com<= /a>>=3D wrote:


2016-09-30 15:35 GMT+02:00 Michal= =3D Skrivanek <michal.skrivanek(a)redhat.com>:


that is a very low level error really =3D pointing at HW issues. It may or may not be detected by memtest=3DE2=3D80= =3DA6bu=3D t I would give it a try


I left memtest86 running for 2 days an= d =3D no error detected :(
 
The=3D only difference that this host (vmhost01) has is that it was the first =3D host installed in my self-hosted engine installation. But I have already = =3D reinstalled it from GUI and menawhile I've upgraded to 4.0.4 from =3D 4.0.3.
does it happen only for the big= =3D 96GB VM? The others which you said are working, are they all =3D small?
Might be worth trying other system stability tests, playing =3D with safer/slower settings in BIOS, use lower CPU cluster, etc


Yep, it happens only for the 96GB VM. = =3D Other VMs with fewer RAM (16GB for example) can be created on or =3D migrated to that host flawlessly. I'll try to play a little with BIOS =3D settings but otherwise I'll have the HW replaced. I was only trying to =3D rule out possible oVirt SW problems due to that host being the first I =3D deployed (from CLI) when I installed the cluster.

I understand. Unfortunately it really does look like =3D some sort of incompatibility rather than a sw issue:/


Thanks!

--
Davide Ferrari
Senior Systems Engineer
_______________________________________________
Users =3D mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

=3D --Apple-Mail=3D_C55C9920-F117-4D6C-BD94-0D2709735EBA-- --===============0661634334032152280== Content-Type: multipart/alternative MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="attachment.bin" Ci0tQXBwbGUtTWFpbD1fQzU1Qzk5MjAtRjExNy00RDZDLUJEOTQtMEQyNzA5NzM1RUJBCkNvbnRl bnQtVHJhbnNmZXItRW5jb2Rpbmc6IHF1b3RlZC1wcmludGFibGUKQ29udGVudC1UeXBlOiB0ZXh0 L3BsYWluOwoJY2hhcnNldD11dGYtOAoKCj4gT24gMyBPY3QgMjAxNiwgYXQgMTA6MzksIERhdmlk ZSBGZXJyYXJpIDxkYXZpZGVAYmlsbHltb2IuY29tPiB3cm90ZToKPj0yMAo+PTIwCj49MjAKPiAy MDE2LTA5LTMwIDE1OjM1IEdNVCswMjowMCBNaWNoYWwgU2tyaXZhbmVrID0KPG1pY2hhbC5za3Jp dmFuZWtAcmVkaGF0LmNvbSA8bWFpbHRvOm1pY2hhbC5za3JpdmFuZWtAcmVkaGF0LmNvbT4+Ogo+ PTIwCj49MjAKPiB0aGF0IGlzIGEgdmVyeSBsb3cgbGV2ZWwgZXJyb3IgcmVhbGx5IHBvaW50aW5n IGF0IEhXIGlzc3Vlcy4gSXQgbWF5IG9yID0KbWF5IG5vdCBiZSBkZXRlY3RlZCBieSBtZW10ZXN0 PUUyPTgwPUE2YnV0IEkgd291bGQgZ2l2ZSBpdCBhIHRyeQo+PTIwCj49MjAKPiBJIGxlZnQgbWVt dGVzdDg2IHJ1bm5pbmcgZm9yIDIgZGF5cyBhbmQgbm8gZXJyb3IgZGV0ZWN0ZWQgOigKPiA9MjAK Pj4gVGhlIG9ubHkgZGlmZmVyZW5jZSB0aGF0IHRoaXMgaG9zdCAodm1ob3N0MDEpIGhhcyBpcyB0 aGF0IGl0IHdhcyB0aGUgPQpmaXJzdCBob3N0IGluc3RhbGxlZCBpbiBteSBzZWxmLWhvc3RlZCBl bmdpbmUgaW5zdGFsbGF0aW9uLiBCdXQgSSBoYXZlID0KYWxyZWFkeSByZWluc3RhbGxlZCBpdCBm cm9tIEdVSSBhbmQgbWVuYXdoaWxlIEkndmUgdXBncmFkZWQgdG8gNC4wLjQgPQpmcm9tIDQuMC4z Lgo+PTIwCj4gZG9lcyBpdCBoYXBwZW4gb25seSBmb3IgdGhlIGJpZyA5NkdCIFZNPyBUaGUgb3Ro ZXJzIHdoaWNoIHlvdSBzYWlkIGFyZSA9CndvcmtpbmcsIGFyZSB0aGV5IGFsbCBzbWFsbD8KPiBN aWdodCBiZSB3b3J0aCB0cnlpbmcgb3RoZXIgc3lzdGVtIHN0YWJpbGl0eSB0ZXN0cywgcGxheWlu ZyB3aXRoID0Kc2FmZXIvc2xvd2VyIHNldHRpbmdzIGluIEJJT1MsIHVzZSBsb3dlciBDUFUgY2x1 c3RlciwgZXRjCj49MjAKPj0yMAo+IFllcCwgaXQgaGFwcGVucyBvbmx5IGZvciB0aGUgOTZHQiBW TS4gT3RoZXIgVk1zIHdpdGggZmV3ZXIgUkFNICgxNkdCID0KZm9yIGV4YW1wbGUpIGNhbiBiZSBj cmVhdGVkIG9uIG9yIG1pZ3JhdGVkIHRvIHRoYXQgaG9zdCBmbGF3bGVzc2x5LiBJJ2xsID0KdHJ5 IHRvIHBsYXkgYSBsaXR0bGUgd2l0aCBCSU9TIHNldHRpbmdzIGJ1dCBvdGhlcndpc2UgSSdsbCBo YXZlIHRoZSBIVyA9CnJlcGxhY2VkLiBJIHdhcyBvbmx5IHRyeWluZyB0byBydWxlIG91dCBwb3Nz aWJsZSBvVmlydCBTVyBwcm9ibGVtcyBkdWUgPQp0byB0aGF0IGhvc3QgYmVpbmcgdGhlIGZpcnN0 IEkgZGVwbG95ZWQgKGZyb20gQ0xJKSB3aGVuIEkgaW5zdGFsbGVkIHRoZSA9CmNsdXN0ZXIuCgpJ IHVuZGVyc3RhbmQuIFVuZm9ydHVuYXRlbHkgaXQgcmVhbGx5IGRvZXMgbG9vayBsaWtlIHNvbWUg c29ydCBvZiA9CmluY29tcGF0aWJpbGl0eSByYXRoZXIgdGhhbiBhIHN3IGlzc3VlOi8KCj49MjAK PiBUaGFua3MhCj49MjAKPiAtLT0yMAo+IERhdmlkZSBGZXJyYXJpCj4gU2VuaW9yIFN5c3RlbXMg RW5naW5lZXIKPiBfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f Xwo+IFVzZXJzIG1haWxpbmcgbGlzdAo+IFVzZXJzQG92aXJ0Lm9yZwo+IGh0dHA6Ly9saXN0cy5v dmlydC5vcmcvbWFpbG1hbi9saXN0aW5mby91c2VycwoKCi0tQXBwbGUtTWFpbD1fQzU1Qzk5MjAt RjExNy00RDZDLUJEOTQtMEQyNzA5NzM1RUJBCkNvbnRlbnQtVHJhbnNmZXItRW5jb2Rpbmc6IHF1 b3RlZC1wcmludGFibGUKQ29udGVudC1UeXBlOiB0ZXh0L2h0bWw7CgljaGFyc2V0PXV0Zi04Cgo8 aHRtbD48aGVhZD48bWV0YSBodHRwLWVxdWl2PTNEIkNvbnRlbnQtVHlwZSIgY29udGVudD0zRCJ0 ZXh0L2h0bWwgPQpjaGFyc2V0PTNEdXRmLTgiPjwvaGVhZD48Ym9keSBzdHlsZT0zRCJ3b3JkLXdy YXA6IGJyZWFrLXdvcmQ7ID0KLXdlYmtpdC1uYnNwLW1vZGU6IHNwYWNlOyAtd2Via2l0LWxpbmUt YnJlYWs6IGFmdGVyLXdoaXRlLXNwYWNlOyIgPQpjbGFzcz0zRCIiPjxiciBjbGFzcz0zRCIiPjxk aXY+PGJsb2NrcXVvdGUgdHlwZT0zRCJjaXRlIiBjbGFzcz0zRCIiPjxkaXYgPQpjbGFzcz0zRCIi Pk9uIDMgT2N0IDIwMTYsIGF0IDEwOjM5LCBEYXZpZGUgRmVycmFyaSAmbHQ7PGEgPQpocmVmPTNE Im1haWx0bzpkYXZpZGVAYmlsbHltb2IuY29tIiBjbGFzcz0zRCIiPmRhdmlkZUBiaWxseW1vYi5j b208L2E+Jmd0Oz0KIHdyb3RlOjwvZGl2PjxiciBjbGFzcz0zRCJBcHBsZS1pbnRlcmNoYW5nZS1u ZXdsaW5lIj48ZGl2IGNsYXNzPTNEIiI+PGRpdiA9CmRpcj0zRCJsdHIiIGNsYXNzPTNEIiI+PGJy IGNsYXNzPTNEIiI+PGRpdiBjbGFzcz0zRCJnbWFpbF9leHRyYSI+PGJyID0KY2xhc3M9M0QiIj48 ZGl2IGNsYXNzPTNEImdtYWlsX3F1b3RlIj4yMDE2LTA5LTMwIDE1OjM1IEdNVCswMjowMCBNaWNo YWwgPQpTa3JpdmFuZWsgPHNwYW4gZGlyPTNEImx0ciIgY2xhc3M9M0QiIj4mbHQ7PGEgPQpocmVm PTNEIm1haWx0bzptaWNoYWwuc2tyaXZhbmVrQHJlZGhhdC5jb20iIHRhcmdldD0zRCJfYmxhbmsi ID0KY2xhc3M9M0QiIj5taWNoYWwuc2tyaXZhbmVrQHJlZGhhdC5jb208L2E+Jmd0Ozwvc3Bhbj46 PGJyID0KY2xhc3M9M0QiIj48YmxvY2txdW90ZSBjbGFzcz0zRCJnbWFpbF9xdW90ZSIgc3R5bGU9 M0QibWFyZ2luOjAgMCAwID0KLjhleDtib3JkZXItbGVmdDoxcHggI2NjYyBzb2xpZDtwYWRkaW5n LWxlZnQ6MWV4Ij48ZGl2ID0Kc3R5bGU9M0Qid29yZC13cmFwOmJyZWFrLXdvcmQiIGNsYXNzPTNE IiI+PGJyIGNsYXNzPTNEIiI+PGJyID0KY2xhc3M9M0QiIj48ZGl2IGNsYXNzPTNEIiI+dGhhdCBp cyBhIHZlcnkgbG93IGxldmVsIGVycm9yIHJlYWxseSA9CnBvaW50aW5nIGF0IEhXIGlzc3Vlcy4g SXQgbWF5IG9yIG1heSBub3QgYmUgZGV0ZWN0ZWQgYnkgbWVtdGVzdD1FMj04MD1BNmJ1PQp0IEkg d291bGQgZ2l2ZSBpdCBhIHRyeTwvZGl2PjxkaXYgY2xhc3M9M0QiIj48c3BhbiBjbGFzcz0zRCIi PjxiciA9CmNsYXNzPTNEIiI+PC9zcGFuPjwvZGl2PjwvZGl2PjwvYmxvY2txdW90ZT48ZGl2IGNs YXNzPTNEIiI+PGJyID0KY2xhc3M9M0QiIj48L2Rpdj48ZGl2IGNsYXNzPTNEIiI+SSBsZWZ0IG1l bXRlc3Q4NiBydW5uaW5nIGZvciAyIGRheXMgYW5kID0Kbm8gZXJyb3IgZGV0ZWN0ZWQgOig8YnIg Y2xhc3M9M0QiIj4mbmJzcDs8YnIgY2xhc3M9M0QiIj48L2Rpdj48YmxvY2txdW90ZSA9CmNsYXNz PTNEImdtYWlsX3F1b3RlIiBzdHlsZT0zRCJtYXJnaW46MCAwIDAgLjhleDtib3JkZXItbGVmdDox cHggI2NjYyA9CnNvbGlkO3BhZGRpbmctbGVmdDoxZXgiPjxkaXYgc3R5bGU9M0Qid29yZC13cmFw OmJyZWFrLXdvcmQiID0KY2xhc3M9M0QiIj48ZGl2IGNsYXNzPTNEIiI+PHNwYW4gY2xhc3M9M0Qi Ij48YmxvY2txdW90ZSB0eXBlPTNEImNpdGUiID0KY2xhc3M9M0QiIj48ZGl2IGNsYXNzPTNEIiI+ PGRpdiBkaXI9M0QibHRyIiBjbGFzcz0zRCIiPjxkaXYgY2xhc3M9M0QiIj5UaGU9CiBvbmx5IGRp ZmZlcmVuY2UgdGhhdCB0aGlzIGhvc3QgKHZtaG9zdDAxKSBoYXMgaXMgdGhhdCBpdCB3YXMgdGhl IGZpcnN0ID0KaG9zdCBpbnN0YWxsZWQgaW4gbXkgc2VsZi1ob3N0ZWQgZW5naW5lIGluc3RhbGxh dGlvbi4gQnV0IEkgaGF2ZSBhbHJlYWR5ID0KcmVpbnN0YWxsZWQgaXQgZnJvbSBHVUkgYW5kIG1l bmF3aGlsZSBJJ3ZlIHVwZ3JhZGVkIHRvIDQuMC40IGZyb20gPQo0LjAuMy48YnIgY2xhc3M9M0Qi Ij48L2Rpdj48L2Rpdj48L2Rpdj48L2Jsb2NrcXVvdGU+PGRpdiBjbGFzcz0zRCIiPjxiciA9CmNs YXNzPTNEIiI+PC9kaXY+PC9zcGFuPjxkaXYgY2xhc3M9M0QiIj5kb2VzIGl0IGhhcHBlbiBvbmx5 IGZvciB0aGUgYmlnID0KOTZHQiBWTT8gVGhlIG90aGVycyB3aGljaCB5b3Ugc2FpZCBhcmUgd29y a2luZywgYXJlIHRoZXkgYWxsID0Kc21hbGw/PC9kaXY+TWlnaHQgYmUgd29ydGggdHJ5aW5nIG90 aGVyIHN5c3RlbSBzdGFiaWxpdHkgdGVzdHMsIHBsYXlpbmcgPQp3aXRoIHNhZmVyL3Nsb3dlciBz ZXR0aW5ncyBpbiBCSU9TLCB1c2UgbG93ZXIgQ1BVIGNsdXN0ZXIsIGV0YzwvZGl2PjxkaXYgPQpj bGFzcz0zRCIiPjxkaXYgY2xhc3M9M0QiaDUiPjxiciA9CmNsYXNzPTNEIiI+PC9kaXY+PC9kaXY+ PC9kaXY+PC9ibG9ja3F1b3RlPjxkaXYgY2xhc3M9M0QiIj48YnIgPQpjbGFzcz0zRCIiPjwvZGl2 PjxkaXYgY2xhc3M9M0QiIj5ZZXAsIGl0IGhhcHBlbnMgb25seSBmb3IgdGhlIDk2R0IgVk0uID0K T3RoZXIgVk1zIHdpdGggZmV3ZXIgUkFNICgxNkdCIGZvciBleGFtcGxlKSBjYW4gYmUgY3JlYXRl ZCBvbiBvciA9Cm1pZ3JhdGVkIHRvIHRoYXQgaG9zdCBmbGF3bGVzc2x5LiBJJ2xsIHRyeSB0byBw bGF5IGEgbGl0dGxlIHdpdGggQklPUyA9CnNldHRpbmdzIGJ1dCBvdGhlcndpc2UgSSdsbCBoYXZl IHRoZSBIVyByZXBsYWNlZC4gSSB3YXMgb25seSB0cnlpbmcgdG8gPQpydWxlIG91dCBwb3NzaWJs ZSBvVmlydCBTVyBwcm9ibGVtcyBkdWUgdG8gdGhhdCBob3N0IGJlaW5nIHRoZSBmaXJzdCBJID0K ZGVwbG95ZWQgKGZyb20gQ0xJKSB3aGVuIEkgaW5zdGFsbGVkIHRoZSBjbHVzdGVyLjxiciA9CmNs YXNzPTNEIiI+PC9kaXY+PC9kaXY+PC9kaXY+PC9kaXY+PC9kaXY+PC9ibG9ja3F1b3RlPjxkaXY+ PGJyID0KY2xhc3M9M0QiIj48L2Rpdj5JIHVuZGVyc3RhbmQuIFVuZm9ydHVuYXRlbHkgaXQgcmVh bGx5IGRvZXMgbG9vayBsaWtlID0Kc29tZSBzb3J0IG9mIGluY29tcGF0aWJpbGl0eSByYXRoZXIg dGhhbiBhIHN3IGlzc3VlOi88L2Rpdj48ZGl2PjxiciA9CmNsYXNzPTNEIiI+PGJsb2NrcXVvdGUg dHlwZT0zRCJjaXRlIiBjbGFzcz0zRCIiPjxkaXYgY2xhc3M9M0QiIj48ZGl2ID0KZGlyPTNEImx0 ciIgY2xhc3M9M0QiIj48ZGl2IGNsYXNzPTNEImdtYWlsX2V4dHJhIj48ZGl2ID0KY2xhc3M9M0Qi Z21haWxfcXVvdGUiPjxkaXYgY2xhc3M9M0QiIj48YnIgY2xhc3M9M0QiIj48L2Rpdj48ZGl2ID0K Y2xhc3M9M0QiIj5UaGFua3MhPGJyIGNsYXNzPTNEIiI+PC9kaXY+PC9kaXY+PGJyIGNsYXNzPTNE IiI+LS0gPGJyID0KY2xhc3M9M0QiIj48ZGl2IGNsYXNzPTNEImdtYWlsX3NpZ25hdHVyZSIgPQpk YXRhLXNtYXJ0bWFpbD0zRCJnbWFpbF9zaWduYXR1cmUiPjxkaXYgZGlyPTNEImx0ciIgY2xhc3M9 M0QiIj48ZGl2ID0KY2xhc3M9M0QiIj5EYXZpZGUgRmVycmFyaTxiciBjbGFzcz0zRCIiPjwvZGl2 PlNlbmlvciBTeXN0ZW1zIEVuZ2luZWVyPGJyID0KY2xhc3M9M0QiIj48L2Rpdj48L2Rpdj4KPC9k aXY+PC9kaXY+Cl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f PGJyIGNsYXNzPTNEIiI+VXNlcnMgPQptYWlsaW5nIGxpc3Q8YnIgY2xhc3M9M0QiIj48YSBocmVm PTNEIm1haWx0bzpVc2Vyc0BvdmlydC5vcmciID0KY2xhc3M9M0QiIj5Vc2Vyc0BvdmlydC5vcmc8 L2E+PGJyID0KY2xhc3M9M0QiIj5odHRwOi8vbGlzdHMub3ZpcnQub3JnL21haWxtYW4vbGlzdGlu Zm8vdXNlcnM8YnIgPQpjbGFzcz0zRCIiPjwvZGl2PjwvYmxvY2txdW90ZT48L2Rpdj48YnIgY2xh c3M9M0QiIj48L2JvZHk+PC9odG1sPj0KCi0tQXBwbGUtTWFpbD1fQzU1Qzk5MjAtRjExNy00RDZD LUJEOTQtMEQyNzA5NzM1RUJBLS0K --===============0661634334032152280==--