Host failed to attach one of the Storage Domains attached to it

This is a multi-part message in MIME format. --------------060503080301050806000704 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Hello, We have a small oVirt deplopyment, with 2 INTEL CPUs and 1 AMD, therefor we defined 2 clusters. One of the Intel machines is running the management as well (We called this SOL). This deployment is connected to a NAS through NFS shares. Suddenly (no more than 2 days) SOL is reported as "Non operational" and in the Action Items is saying " Host failed to attach one of the Storage Domains attached to it." All other machines seems to be fine. In the past we encountered similar errors after (accidental) reboot of the hosts, but now it is not the case: [root@sol ~]# uptime 08:10:48 up 9 days, 15:48, 1 user, load average: 0.08, 0.05, 0.12 The "funny" is that the virtual machines already started on SOL are still running without any problems. I am not sure what logs to check. In /var/log/vdsm/vdsm.log we have this error, but I do not think it is relevant: Thread-38964::ERROR::2015-07-05 08:37:24,882::__init__::506::jsonrpc.JsonRpcServer::(_serveRequest) Internal server error Traceback (most recent call last): File "/usr/lib/python2.6/site-packages/yajsonrpc/__init__.py", line 501, in _serveRequest res = method(**params) File "/usr/share/vdsm/rpc/Bridge.py", line 271, in _dynamicMethod result = fn(*methodArgs) File "/usr/share/vdsm/API.py", line 1330, in getStats stats.update(self._cif.mom.getKsmStats()) File "/usr/share/vdsm/momIF.py", line 60, in getKsmStats stats = self._mom.getStatistics()['host'] File "/usr/lib/python2.6/site-packages/mom/MOMFuncs.py", line 75, in getStatistics host_stats = self.threads['host_monitor'].interrogate().statistics[-1] AttributeError: 'NoneType' object has no attribute 'statistics' Thank you ! --------------060503080301050806000704 Content-Type: multipart/related; boundary="------------000103070605000308060409" --------------000103070605000308060409 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta http-equiv="content-type" content="text/html; charset=utf-8"> </head> <body bgcolor="#FFFFFF" text="#000000"> Hello,<br> <br> We have a small oVirt deplopyment, with 2 INTEL CPUs and 1 AMD, therefor we defined 2 clusters.<br> One of the Intel machines is running the management as well (We called this SOL).<br> This deployment is connected to a NAS through NFS shares.<br> <br> Suddenly (no more than 2 days) SOL is reported as "Non operational" and in the Action Items is saying "<img class="gwt-Image" onload='this.__gwtLastUnhandledEvent="load";' src="cid:part1.08080605.07050106@matrixrom.com" style="width: 14px; height: 12px; background: transparent url("data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAA4AAAAMCAYAAABSgIzaAAAAKklEQVR42mNgwAL+n0n7/3+mMRifSTP+z0AsGNVIC40gxciYgeZgcPoRAJrmd4n8GZZPAAAAAElFTkSuQmCC") no-repeat scroll 0px 0px; display: inline;" border="0"> <div> <div style="display: inline;" class="gwt-Label">Host failed to attach one of the Storage Domains attached to it."<br> All other machines seems to be fine.<br> <br> In the past we encountered similar errors after (accidental) reboot of the hosts, but now it is not the case:<br> [root@sol ~]# uptime<br> 08:10:48 up 9 days, 15:48, 1 user, load average: 0.08, 0.05, 0.12<br> <br> The "funny" is that the virtual machines already started on SOL are still running without any problems.<br> <br> I am not sure what logs to check.<br> <br> In /var/log/vdsm/vdsm.log we have this error, but I do not think it is relevant:<br> Thread-38964::ERROR::2015-07-05 08:37:24,882::__init__::506::jsonrpc.JsonRpcServer::(_serveRequest) Internal server error<br> Traceback (most recent call last):<br> File "/usr/lib/python2.6/site-packages/yajsonrpc/__init__.py", line 501, in _serveRequest<br> res = method(**params)<br> File "/usr/share/vdsm/rpc/Bridge.py", line 271, in _dynamicMethod<br> result = fn(*methodArgs)<br> File "/usr/share/vdsm/API.py", line 1330, in getStats<br> stats.update(self._cif.mom.getKsmStats())<br> File "/usr/share/vdsm/momIF.py", line 60, in getKsmStats<br> stats = self._mom.getStatistics()['host']<br> File "/usr/lib/python2.6/site-packages/mom/MOMFuncs.py", line 75, in getStatistics<br> host_stats = self.threads['host_monitor'].interrogate().statistics[-1]<br> AttributeError: 'NoneType' object has no attribute 'statistics'<br> <br> Thank you !<br> </div> </div> </body> </html> --------------000103070605000308060409-- --------------060503080301050806000704--
participants (1)
-
Alin Ilie