
This is a multi-part message in MIME format. --------------C1E6C0A57787BAEC7E6BD4CD Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Hi Johan, On 07/18/2016 09:53 AM, Johan Kooijman wrote:
Hi Jeff,
was the issue ever resolved? Don't have permissions to view the bugzilla.
There are proposal patches in the bugzilla, I have requested more information about upstream status. As soon I have updates, I will reply here. For now, if you have the hardware and want to give a test against our latest upstream build jobs, links below: ovirt-node 3.6: http://jenkins.ovirt.org/job/ovirt-node_ovirt-3.6_create-iso-el7_merged/ ovirt-node 4.0 (next): http://jenkins.ovirt.org/job/ovirt-node-ng_ovirt-4.0-snapshot_build-artifact... Thanks!
On Thu, Mar 17, 2016 at 4:34 PM, Jeff Spahr <spahrj@gmail.com <mailto:spahrj@gmail.com>> wrote:
I had the same issue, and I also have a support case open. They referenced https://bugzilla.redhat.com/show_bug.cgi?id=1288237 which is private. I didn't have any success getting that bugzilla changed to public. We couldn't keep waiting for the issue to be fixed so we replaced the NICs with Broadcom/Qlogic that we knew had no issues in other hosts.
On Thu, Mar 17, 2016 at 11:27 AM, Sigbjorn Lie <sigbjorn@nixtra.com <mailto:sigbjorn@nixtra.com>> wrote:
Hi,
Is this on CentOS/RHEL 7.2?
Log in as root as see if you can see any messages from ixgbe about "tx queue hung" in dmesg. I currently have an open support case for RHEL7.2 and the ixgbe driver, where there is a driver issue causing the network adapter to reset continuously when there are network traffic.
Regards, Siggi
On Thu, March 17, 2016 12:52, Nir Soffer wrote: > On Thu, Mar 17, 2016 at 10:49 AM, Johan Kooijman <mail@johankooijman.com <mailto:mail@johankooijman.com>> wrote: > >> Hi all, >> >> >> Since we upgraded to the latest ovirt node running 7.2, we're seeing that >> nodes become unavailable after a while. It's running fine, with a couple of VM's on it, untill it >> becomes non responsive. At that moment it doesn't even respond to ICMP. It'll come back by >> itself after a while, but oVirt fences the machine before that time and restarts VM's elsewhere. >> >> >> Engine tells me this message: >> >> >> VDSM host09 command failed: Message timeout which can be caused by >> communication issues >> >> Is anyone else experiencing these issues with ixgbe drivers? I'm running on >> Intel X540-AT2 cards. >> > > We will need engine and vdsm logs to understand this issue. > > > Can you file a bug and attach ful logs? > > > Nir > _______________________________________________ > Users mailing list > Users@ovirt.org <mailto:Users@ovirt.org> > http://lists.ovirt.org/mailman/listinfo/users > >
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Met vriendelijke groeten / With kind regards, Johan Kooijman
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
--------------C1E6C0A57787BAEC7E6BD4CD Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: 8bit <html> <head> <meta content="text/html; charset=windows-1252" http-equiv="Content-Type"> </head> <body bgcolor="#FFFFFF" text="#000000"> <p>Hi Johan,<br> </p> <br> <div class="moz-cite-prefix">On 07/18/2016 09:53 AM, Johan Kooijman wrote:<br> </div> <blockquote cite="mid:CAHvs-HX2=e9qR61g9vu9inVikaspxFJgR8aRZwGOpUWN0wC1rQ@mail.gmail.com" type="cite"> <div dir="ltr">Hi Jeff, <div><br> </div> <div>was the issue ever resolved? Don't have permissions to view the bugzilla.</div> </div> </blockquote> <br> There are proposal patches in the bugzilla, I have requested more information about upstream status.<br> As soon I have updates, I will reply here. <br> <br> For now, if you have the hardware and want to give a test against our latest upstream build jobs, links below:<br> <br> ovirt-node 3.6:<br> <a class="moz-txt-link-freetext" href="http://jenkins.ovirt.org/job/ovirt-node_ovirt-3.6_create-iso-el7_merged/">http://jenkins.ovirt.org/job/ovirt-node_ovirt-3.6_create-iso-el7_merged/</a><br> <br> ovirt-node 4.0 (next):<br> <a class="moz-txt-link-freetext" href="http://jenkins.ovirt.org/job/ovirt-node-ng_ovirt-4.0-snapshot_build-artifacts-fc23-x86_64/">http://jenkins.ovirt.org/job/ovirt-node-ng_ovirt-4.0-snapshot_build-artifacts-fc23-x86_64/</a><br> <br> Thanks!<br> <br> <blockquote cite="mid:CAHvs-HX2=e9qR61g9vu9inVikaspxFJgR8aRZwGOpUWN0wC1rQ@mail.gmail.com" type="cite"> <div class="gmail_extra"><br> <div class="gmail_quote">On Thu, Mar 17, 2016 at 4:34 PM, Jeff Spahr <span dir="ltr"><<a moz-do-not-send="true" href="mailto:spahrj@gmail.com" target="_blank">spahrj@gmail.com</a>></span> wrote:<br> <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> <div dir="ltr">I had the same issue, and I also have a support case open. They referenced <a moz-do-not-send="true" href="https://bugzilla.redhat.com/show_bug.cgi?id=1288237" target="_blank">https://bugzilla.redhat.com/show_bug.cgi?id=1288237</a> which is private. I didn't have any success getting that bugzilla changed to public. We couldn't keep waiting for the issue to be fixed so we replaced the NICs with Broadcom/Qlogic that we knew had no issues in other hosts.<br> </div> <div class="HOEnZb"> <div class="h5"> <div class="gmail_extra"><br> <div class="gmail_quote">On Thu, Mar 17, 2016 at 11:27 AM, Sigbjorn Lie <span dir="ltr"><<a moz-do-not-send="true" href="mailto:sigbjorn@nixtra.com" target="_blank">sigbjorn@nixtra.com</a>></span> wrote:<br> <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br> <br> Is this on CentOS/RHEL 7.2?<br> <br> Log in as root as see if you can see any messages from ixgbe about "tx queue hung" in dmesg. I<br> currently have an open support case for RHEL7.2 and the ixgbe driver, where there is a driver<br> issue causing the network adapter to reset continuously when there are network traffic.<br> <br> <br> Regards,<br> Siggi<br> <br> <br> <br> On Thu, March 17, 2016 12:52, Nir Soffer wrote:<br> > On Thu, Mar 17, 2016 at 10:49 AM, Johan Kooijman <<a moz-do-not-send="true" href="mailto:mail@johankooijman.com" target="_blank">mail@johankooijman.com</a>> wrote:<br> ><br> >> Hi all,<br> >><br> >><br> >> Since we upgraded to the latest ovirt node running 7.2, we're seeing that<br> >> nodes become unavailable after a while. It's running fine, with a couple of VM's on it, untill it<br> >> becomes non responsive. At that moment it doesn't even respond to ICMP. It'll come back by<br> >> itself after a while, but oVirt fences the machine before that time and restarts VM's elsewhere.<br> >><br> >><br> >> Engine tells me this message:<br> >><br> >><br> >> VDSM host09 command failed: Message timeout which can be caused by<br> >> communication issues<br> >><br> >> Is anyone else experiencing these issues with ixgbe drivers? I'm running on<br> >> Intel X540-AT2 cards.<br> >><br> ><br> > We will need engine and vdsm logs to understand this issue.<br> ><br> ><br> > Can you file a bug and attach ful logs?<br> ><br> ><br> > Nir<br> > _______________________________________________<br> > Users mailing list<br> > <a moz-do-not-send="true" href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br> > <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a><br> ><br> ><br> <br> <br> _______________________________________________<br> Users mailing list<br> <a moz-do-not-send="true" href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br> <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a><br> </blockquote> </div> <br> </div> </div> </div> <br> _______________________________________________<br> Users mailing list<br> <a moz-do-not-send="true" href="mailto:Users@ovirt.org">Users@ovirt.org</a><br> <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a><br> <br> </blockquote> </div> <br> <br clear="all"> <div><br> </div> -- <br> <div class="gmail_signature" data-smartmail="gmail_signature"> <div dir="ltr">Met vriendelijke groeten / With kind regards,<br> Johan Kooijman<br> </div> </div> </div> <br> <fieldset class="mimeAttachmentHeader"></fieldset> <br> <pre wrap="">_______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> <br> </body> </html> --------------C1E6C0A57787BAEC7E6BD4CD--