[Users] Problem with VDSM becoming unresponsive

Hi I have an issue with a one node ovirt setup with gluster storage. Every so often (twice a week or) the management console show the node as unresponsive. I can ssh to the node fine and it is indeed responsive. I can see the VM processes taking CPU with top. I can see the VDSM process as well. Restarting vsdmd causes the node to become in the up state again, but I have to restart all the VMs that are running. In the engine log file (attached) there is "VDS::handleNetworkException Server failed to respond" error. I am not sure how I can fix this so any help appreciated. Attached is the engine log. Regards Daniel

please attach vdsm log, libvirtd.log, sanlock, and messages. what's the status of the following services: vdsmd, libvirtd, sanlock ----- Original Message -----
From: "Daniel Rowe" <daniel.fathom13@gmail.com> To: users@ovirt.org Sent: Monday, December 17, 2012 2:43:34 AM Subject: [Users] Problem with VDSM becoming unresponsive
Hi
I have an issue with a one node ovirt setup with gluster storage.
Every so often (twice a week or) the management console show the node as unresponsive.
I can ssh to the node fine and it is indeed responsive. I can see the VM processes taking CPU with top. I can see the VDSM process as well.
Restarting vsdmd causes the node to become in the up state again, but I have to restart all the VMs that are running.
In the engine log file (attached) there is "VDS::handleNetworkException Server failed to respond" error.
I am not sure how I can fix this so any help appreciated.
Attached is the engine log.
Regards Daniel
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
participants (2)
-
Daniel Rowe
-
Haim Ateya