Gents,

I'm sending this email for archiving purposes:
It's been a while since my last update on this topic. It turned out that although only one node, randomly, at a time lost connection to storage, the issue was not at all with ovirt, but with the storage.

I'd like to refer to these 2 topics for more information: http://lists.freebsd.org/pipermail/freebsd-net/2014-March/038061.html and http://lists.freebsd.org/pipermail/freebsd-net/2014-February/037967.html.

Keywords: ovirt freebsd 9.2 zfs ixgbe intel 10gbit NFS


On Mon, Feb 24, 2014 at 3:55 PM, Ronen Hod <rhod@redhat.com> wrote:
On 02/24/2014 11:48 AM, Nir Soffer wrote:
----- Original Message -----
From: "Johan Kooijman" <mail@johankooijman.com>
To: "Nir Soffer" <nsoffer@redhat.com>
Cc: "users" <users@ovirt.org>
Sent: Monday, February 24, 2014 2:45:59 AM
Subject: Re: [Users] Nodes lose storage at random

Interestingly enough - same thing happened today, around the same time.
Logs from this host are attached.

Around 1:10 AM stuff starts to go wrong again. Same pattern - we reboot the
node and the node is fine again.
So we made some progress, we know that it is not a problem with old kernel.

In messages we see the same picture:

1. sanlock fail to renew the lease
2. after 80 secodns, kill vdsm
3. sanlock and vdsm cannot access the storage
4. kernel complain about nfs server timeouts
    (explains why sanlock failed to renew the lease)
5. after reboot, nfs is accessible again
6. after few days goto step 1

This looks like kernel nfs issue.

Could be also kvm issue (running bsd on the one of the vm?)

Could be also some incompatibility with the nfs server - maybe you are using
esoteric configuration options?

CCing Ronen, in case this is related to kvm.

Not seems to be related to KVM.
Adding Ric Wheeler.

Ronen.


thread: http://lists.ovirt.org/pipermail/users/2014-February/021507.html

Nir




--
Met vriendelijke groeten / With kind regards,
Johan Kooijman

T +31(0) 6 43 44 45 27
F +31(0) 162 82 00 01
E mail@johankooijman.com