I had the same issue, and I also have a support case open.  They referenced https://bugzilla.redhat.com/show_bug.cgi?id=1288237 which is private.  I didn't have any success getting that bugzilla changed to public.  We couldn't keep waiting for the issue to be fixed so we replaced the NICs with Broadcom/Qlogic that we knew had no issues in other hosts.

On Thu, Mar 17, 2016 at 11:27 AM, Sigbjorn Lie <sigbjorn@nixtra.com> wrote:
Hi,

Is this on CentOS/RHEL 7.2?

Log in as root as see if you can see any messages from ixgbe about "tx queue hung" in dmesg. I
currently have an open support case for RHEL7.2 and the ixgbe driver, where there is a driver
issue causing the network adapter to reset continuously when there are network traffic.


Regards,
Siggi



On Thu, March 17, 2016 12:52, Nir Soffer wrote:
> On Thu, Mar 17, 2016 at 10:49 AM, Johan Kooijman <mail@johankooijman.com> wrote:
>
>> Hi all,
>>
>>
>> Since we upgraded to the latest ovirt node running 7.2, we're seeing that
>> nodes become unavailable after a while. It's running fine, with a couple of VM's on it, untill it
>> becomes non responsive. At that moment it doesn't even respond to ICMP. It'll come back by
>> itself after a while, but oVirt fences the machine before that time and restarts VM's elsewhere.
>>
>>
>> Engine tells me this message:
>>
>>
>> VDSM host09 command failed: Message timeout which can be caused by
>> communication issues
>>
>> Is anyone else experiencing these issues with ixgbe drivers? I'm running on
>> Intel X540-AT2 cards.
>>
>
> We will need engine and vdsm logs to understand this issue.
>
>
> Can you file a bug and attach ful logs?
>
>
> Nir
> _______________________________________________
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>


_______________________________________________
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users