Hello All,
I have recently had the pleasure of testing oVirt 3.4.2 in our network. I have two hypervisors running a HA hosted engine and four Gluster servers. The four Gluster servers run two separate replicated storage domains. After a fresh install everything seemed to work, but when I rebooted Gluster server #4 it would not move to the UP state. After some investigation I realized that the glusterd daemon was not set to start upon boot. My resolution was to run "chkconfig glusterd on && service glusterd start". Now I encountered a new error. The server keeps moving from an UP status to DOWN status and vice versa. This seems to be the related error message in /var/log/messages:
"Jun 24 07:59:59 CTI-SAN06 vdsm vds ERROR vdsm exception occured#012Traceback (most recent call last):#012 File "/usr/share/vdsm/BindingXMLRPC.py", line 1070, in wrapper#012 res = f(*args, **kwargs)#012 File "/usr/share/vdsm/gluster/api.py", line 54, in wrapper#012 rv = func(*args, **kwargs)#012 File "/usr/share/vdsm/gluster/api.py", line 240, in hostsList#012 return {'hosts': self.svdsmProxy.glusterPeerStatus()}#012 File "/usr/share/vdsm/supervdsm.py", line 50, in __call__#012 return callMethod()#012 File "/usr/share/vdsm/supervdsm.py", line 48, in <lambda>#012 **kwargs)#012 File "<string>", line 2, in glusterPeerStatus#012 File "/usr/lib64/python2.6/multiprocessing/managers.py", line 740, in _callmethod#012 raise convert_to_error(kind, result)#012GlusterCmdExecFailedException: Command execution failed#012error: Connection failed. Please check if gluster daemon is operational.#012return code: 1".
The last line seems to be a dead giveaway "Please check
if gluster daemon is operational". When I run "server glusterd
status" it prints "glusterd (pid xxxxx) is running...".
So I did some Internet searches and found this bug
http://gerrit.ovirt.org/#/c/23982/, but it says that it has been fixed. So I
thought maybe my packages are out of date. Furthermore I updated my packages
using "yum -y update". All my Gluster servers are showing that they
are running Gluster 3.5.0. I am still having no success getting my fourth
Gluster server to remain in the UP state.
Any advice/help would be greatly appreciated!