<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><div class="">On 17 Apr 2018, at 11:28, Stefano Stagnaro <<a href="mailto:stefanos@prismatelecomtesting.com" class="">stefanos@prismatelecomtesting.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><meta http-equiv="Content-Type" content="text/html; charset=utf-8" class=""><div class=""><div class="">On Thu, 2018-04-12 at 20:20 +0200, Michal Skrivanek wrote:</div><blockquote type="cite" class=""><br class=""><div class=""><br class=""><blockquote type="cite" class=""><div class="">On 12 Apr 2018, at 18:26, Stefano Stagnaro <<a href="mailto:stefanos@prismatelecomtesting.com" class="">stefanos@prismatelecomtesting.com</a>> wrote:</div><div class=""><div class="">Hi,<br class=""><br class="">I recently upgraded an oVirt deployment from 3.6 to 4.0 and then 4.1.9 (my actual release). Since then, when migrating many hosts simultaneously I always experience few migrations failure like 1 on 10 vms. The failure can occur on any host; moreover, after a couple of failure the destination host fall in Error status and I have to manually re-activate or wait 30 min.<br class=""></div></div></blockquote></div></blockquote></div></div></blockquote><div><br class=""></div>Are they all in a 3.6 cluster? </div><div><br class=""><blockquote type="cite" class=""><div class=""><div class=""><blockquote type="cite" class=""><div class=""><blockquote type="cite" class=""><div class=""><div class=""><br class="">Tipical error found on vdsm log is (from the source host):<br class="">2018-04-12 17:01:32,097+0200 ERROR (migsrc/3192dfe7) [virt.vm] (vmId='3192dfe7-eeac-4626-8c86-e49facc9006f') migration destination error: Fatal error during migration (migration:287)<br class=""><br class="">Please find the logs of source host (v15.ovirt), destination host (v14.ovirt) and engine here: <a href="https://www.dropbox.com/sh/xhf8ry4ih40poxd/AABxiFCIxDe14HSx2DqLE61ya?dl=0" class="">https://www.dropbox.com/sh/xhf8ry4ih40poxd/AABxiFCIxDe14HSx2DqLE61ya?dl=0</a><br class=""><br class="">Some of the vm affected from the migration failure are:<br class="">svn        3192dfe7-eeac-4626-8c86-e49facc9006f<br class="">wood        a8e83ff0-dfed-4074-b6b6-e947b8ebb952<br class="">qnx66        5697c4a4-9e40-4dd6-aba2-c8ab9904a584<br class=""></div></div></blockquote><div class=""><br class=""></div>can you also include qemu log from /var/log/libvirt/qemu/<vmname>?</div></blockquote><div class=""><br class=""></div><div class="">Hi Michal, I've added libvirt logs for relevant VMs on the previous Dropbox share.</div></div></div></blockquote><div><br class=""></div>I do not see anything wrong. It’s a bit too much data to go through, can you pinpoint the time and VM name when you see a failure?</div><div><br class=""></div><div>Thanks,</div><div>michal<br class=""><blockquote type="cite" class=""><div class=""><div class=""><div class=""><br class=""></div><blockquote type="cite" class=""><div class=""><br class=""></div><div class="">btw you seem to be using the legacy migration policy throttling the speed significantly. Please read into the migration enhancements in 4.0</div><div class=""><a href="https://www.ovirt.org/develop/release-management/features/virt/migration-enhancements/" class="">https://www.ovirt.org/develop/release-management/features/virt/migration-enhancements/</a></div></blockquote><div class=""><br class=""></div><div class="">I've already moved to Minimal Downtime and then to Post-copy with same results. VM migrations continue to fail randomly.</div><div class=""><br class=""></div><blockquote type="cite" class=""><div class=""><br class=""></div><div class="">Thanks,</div><div class="">michal</div></blockquote><div class=""><br class=""></div><div class="">Thanks,</div><div class="">Stefano.</div><div class=""><br class=""></div><div class=""><br class=""></div><blockquote type="cite" class=""><div class=""><br class=""><blockquote type="cite" class=""><div class=""><div class=""><br class="">Thank you very much for your help.<br class=""><br class="">-- <br class="">Stefano Stagnaro<br class=""><br class="">Prisma Telecom Testing S.r.l.<br class="">Via Petrocchi, 4<br class="">20127 Milano – Italy<br class=""><br class="">Tel. 02 26113507 int 339<br class=""><a href="mailto:stefanos@prismatelecomtesting.com" class="">e-mail: stefanos@prismatelecomtesting.com</a><br class="">skype: stefano.stagnaro<br class="">_______________________________________________<br class="">Users mailing list<br class=""><a href="mailto:Users@ovirt.org" class="">Users@ovirt.org</a><br class="">http://lists.ovirt.org/mailman/listinfo/users<br class=""></div></div></blockquote></div><br class=""></blockquote><div class=""><span class=""><pre class=""><pre class=""><br class=""></pre></pre></span></div></div></div></blockquote></div><br class=""></body></html>