Hi
I have an issue with a one node ovirt setup with gluster storage.
Every so often (twice a week or) the management console show the node
as unresponsive.
I can ssh to the node fine and it is indeed responsive. I can see the
VM processes taking CPU with top. I can see the VDSM process as well.
Restarting vsdmd causes the node to become in the up state again, but
I have to restart all the VMs that are running.
In the engine log file (attached) there is
"VDS::handleNetworkException Server failed to respond" error.
I am not sure how I can fix this so any help appreciated.
Attached is the engine log.
Regards
Daniel