This is a multi-part message in MIME format.
--------------050503030109030902030106
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Hello,
We have a small oVirt deplopyment, with 2 INTEL CPUs and 1 AMD, therefor
we defined 2 clusters.
One of the Intel machines is running the management as well (We called
this SOL).
This deployment is connected to a NAS through NFS shares.
Suddenly (no more than 2 days) SOL is reported as "Non operational" and
in the Action Items is saying "
Host failed to attach one of the Storage Domains attached to it."
All other machines seems to be fine.
In the past we encountered similar errors after (accidental) reboot of
the hosts, but now it is not the case:
[root@sol ~]# uptime
08:10:48 up 9 days, 15:48, 1 user, load average: 0.08, 0.05, 0.12
The "funny" is that the virtual machines already started on SOL are
still running without any problems.
I am not sure what logs to check.
In /var/log/vdsm/vdsm.log we have this error, but I do not think it is
relevant:
Thread-38964::ERROR::2015-07-05
08:37:24,882::__init__::506::jsonrpc.JsonRpcServer::(_serveRequest)
Internal server error
Traceback (most recent call last):
File "/usr/lib/python2.6/site-packages/yajsonrpc/__init__.py", line
501, in _serveRequest
res = method(**params)
File "/usr/share/vdsm/rpc/Bridge.py", line 271, in _dynamicMethod
result = fn(*methodArgs)
File "/usr/share/vdsm/API.py", line 1330, in getStats
stats.update(self._cif.mom.getKsmStats())
File "/usr/share/vdsm/momIF.py", line 60, in getKsmStats
stats = self._mom.getStatistics()['host']
File "/usr/lib/python2.6/site-packages/mom/MOMFuncs.py", line 75, in
getStatistics
host_stats = self.threads['host_monitor'].interrogate().statistics[-1]
AttributeError: 'NoneType' object has no attribute 'statistics'
Thank you !
--------------050503030109030902030106
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: 8bit
<html>
<head>
<meta http-equiv="content-type" content="text/html;
charset=utf-8">
</head>
<body bgcolor="#FFFFFF" text="#000000">
Hello,<br>
<br>
We have a small oVirt deplopyment, with 2 INTEL CPUs and 1 AMD,
therefor we defined 2 clusters.<br>
One of the Intel machines is running the management as well (We
called this SOL).<br>
This deployment is connected to a NAS through NFS shares.<br>
<br>
Suddenly (no more than 2 days) SOL is reported as "Non operational"
and in the Action Items is saying "<img class="gwt-Image"
onload='this.__gwtLastUnhandledEvent="load";'
src="cid:part1.08080605.07050106@matrixrom.com" style="width:
14px; height: 12px; background: transparent
url("data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAA4AAAAMCAYAAABSgIzaAAAAKklEQVR42mNgwAL+n0n7/3+mMRifSTP+z0AsGNVIC40gxciYgeZgcPoRAJrmd4n8GZZPAAAAAElFTkSuQmCC")
no-repeat scroll 0px 0px; display: inline;" border="0">
<div>
<div style="display: inline;" class="gwt-Label">Host
failed to
attach one of the Storage Domains attached to it."<br>
All other machines seems to be fine.<br>
<br>
In the past we encountered similar errors after (accidental)
reboot of the hosts, but now it is not the case:<br>
[root@sol ~]# uptime<br>
08:10:48 up 9 days, 15:48, 1 user, load average: 0.08, 0.05,
0.12<br>
<br>
The "funny" is that the virtual machines already started on SOL
are still running without any problems.<br>
<br>
I am not sure what logs to check.<br>
<br>
In /var/log/vdsm/vdsm.log we have this error, but I do not think
it is relevant:<br>
Thread-38964::ERROR::2015-07-05
08:37:24,882::__init__::506::jsonrpc.JsonRpcServer::(_serveRequest)
Internal server error<br>
Traceback (most recent call last):<br>
File "/usr/lib/python2.6/site-packages/yajsonrpc/__init__.py",
line 501, in _serveRequest<br>
res = method(**params)<br>
File "/usr/share/vdsm/rpc/Bridge.py", line 271, in
_dynamicMethod<br>
result = fn(*methodArgs)<br>
File "/usr/share/vdsm/API.py", line 1330, in getStats<br>
stats.update(self._cif.mom.getKsmStats())<br>
File "/usr/share/vdsm/momIF.py", line 60, in getKsmStats<br>
stats = self._mom.getStatistics()['host']<br>
File "/usr/lib/python2.6/site-packages/mom/MOMFuncs.py", line
75, in getStatistics<br>
host_stats =
self.threads['host_monitor'].interrogate().statistics[-1]<br>
AttributeError: 'NoneType' object has no attribute
'statistics'<br>
<br>
Thank you !<br>
</div>
</div>
</body>
</html>
--------------050503030109030902030106--