[Users] Nodes lose storage at random

Nir Soffer nsoffer at redhat.com
Wed Feb 19 04:38:03 EST 2014


----- Original Message -----
> From: "Johan Kooijman" <mail at johankooijman.com>
> To: "users" <users at ovirt.org>
> Sent: Tuesday, February 18, 2014 1:32:56 PM
> Subject: [Users] Nodes lose storage at random
> 
> Hi All,
> 
> We're seeing some weird issues in our ovirt setup. We have 4 nodes connected
> and an NFS (v3) filestore (FreeBSD/ZFS).
> 
> Once in a while, it seems at random, a node loses their connection to
> storage, recovers it a minute later. The other nodes usually don't lose
> their storage at that moment. Just one, or two at a time.
> 
> We've setup extra tooling to verify the storage performance at those moments
> and the availability for other systems. It's always online, just the nodes
> don't think so.

In the logs, we see that vdsm was restarted:
MainThread::DEBUG::2014-02-18 10:48:35,809::vdsm::45::vds::(sigtermHandler) Received signal 15

But we don't know why it happened.

Please attach also /var/log/messages and /var/log/sanlock.log around the time that
vdsm was restarted.

Thanks,
Nir


More information about the Users mailing list