Hello,
oVirt 3.2.1 on F18
One mgmt server + one host.
At some point the host had some problem and was rebooted
at reboot ther was a problem because I got
detected unhandled Python exception in '/usr/share/vdsm/vdsm'
watching at
http://www.ovirt.org/Troubleshooting
I verified that for some reason vdsmd.log was root.root.
So I set correct permissions and restarted the node.
Now it stays at "unassigned" status from an engine point of view, that
I don't know what means...
In node messages
Jul 14 16:55:02 management02 vdsm vds WARNING Unable to load the json
rpc server module. Please make sure it is installed.
[root@management02 ~]# systemctl status vdsmd
vdsmd.service - Virtual Desktop Server Manager
Loaded: loaded (/usr/lib/systemd/system/vdsmd.service; enabled)
Active: active (running) since Sun 2013-07-14 16:54:58 CEST; 6min ago
Process: 1015 ExecStart=/lib/systemd/systemd-vdsmd start
(code=exited, status=0/SUCCESS)
Main PID: 2128 (respawn)
CGroup: name=systemd:/system/vdsmd.service
├─2128 /bin/bash -e /usr/share/vdsm/respawn --minlifetime 10
--daemon --masterpid /var/run/vdsm/respawn....
├─2131 /usr/bin/python /usr/share/vdsm/vdsm
├─2151 /usr/bin/sudo -n /usr/bin/python
/usr/share/vdsm/supervdsmServer.py 62a4a000-a73a-4e2c-b1a8-25e24...
└─2152 /usr/bin/python /usr/share/vdsm/supervdsmServer.py
62a4a000-a73a-4e2c-b1a8-25e24587b558 2131 /var...
Jul 14 16:54:58 management02.localdomain.local runuser[2125]:
pam_unix(runuser:session): session opened for user vdsm ...d=0)
Jul 14 16:54:58 management02.localdomain.local runuser[2125]:
pam_unix(runuser:session): session closed for user vdsm
Jul 14 16:54:58 management02.localdomain.local systemd-vdsmd[1015]:
[27B blob data]
Jul 14 16:54:58 management02.localdomain.local systemd[1]: Started
Virtual Desktop Server Manager.
Jul 14 16:54:59 management02.localdomain.local python[2131]:
DIGEST-MD5 client step 2
Jul 14 16:54:59 management02.localdomain.local python[2131]:
DIGEST-MD5 client step 2
Jul 14 16:54:59 management02.localdomain.local python[2131]:
DIGEST-MD5 client step 3
Jul 14 16:54:59 management02.localdomain.local vdsm[2131]: vdsm
fileUtils WARNING Dir /rhev/data-center/mnt already exists
Jul 14 16:55:02 management02.localdomain.local vdsm[2131]: vdsm vds
WARNING Unable to load the json rpc server module....led.
Jul 14 16:55:04 management02.localdomain.local vdsm[2131]: vdsm
fileUtils WARNING Dir /rhev/data-center/hsm-tasks alre...ists
In vdsm.log
torageRefresh::DEBUG::2013-07-14
16:55:01,919::misc::1064::SamplingMethod::(__call__) Returning last
result
MainProcess|storageRefresh::DEBUG::2013-07-14
16:55:01,921::misc::84::Storage.Misc.excCmd::(<lambda>) '/usr/bin/dd
of=/sys/class
/scsi_host/host0/scan' (cwd None)
MainProcess|storageRefresh::DEBUG::2013-07-14
16:55:01,932::iscsi::402::Storage.ISCSI::(forceIScsiScan) Performing
SCSI scan, th
is will take up to 30 seconds
MainThread::INFO::2013-07-14 16:55:02,085::momIF::36::MOM::(__init__)
Starting up MOM
MainThread::INFO::2013-07-14
16:55:02,091::vmChannels::140::vds::(settimeout) Setting channels'
timeout to 30 seconds.
VM Channels Listener::INFO::2013-07-14
16:55:02,091::vmChannels::128::vds::(run) Starting VM channels
listener thread.
MainThread::WARNING::2013-07-14
16:55:02,139::clientIF::175::vds::(_prepareBindings) Unable to load
the json rpc server module. Please make sure it is installed.
storageRefresh::DEBUG::2013-07-14
16:55:04,290::misc::84::Storage.Misc.excCmd::(<lambda>) '/usr/bin/sudo
-n /sbin/multipath' (cwd None)
storageRefresh::DEBUG::2013-07-14
16:55:04,341::misc::84::Storage.Misc.excCmd::(<lambda>) SUCCESS: <err>
= ''; <rc> = 0
storageRefresh::DEBUG::2013-07-14
16:55:04,342::lvm::477::OperationMutex::(_invalidateAllPvs) Operation
'lvm invalidate operation' got the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,342::lvm::479::OperationMutex::(_invalidateAllPvs) Operation
'lvm invalidate operation' released the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,342::lvm::488::OperationMutex::(_invalidateAllVgs) Operation
'lvm invalidate operation' got the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,343::lvm::490::OperationMutex::(_invalidateAllVgs) Operation
'lvm invalidate operation' released the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,343::lvm::508::OperationMutex::(_invalidateAllLvs) Operation
'lvm invalidate operation' got the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,343::lvm::510::OperationMutex::(_invalidateAllLvs) Operation
'lvm invalidate operation' released the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,344::misc::1064::SamplingMethod::(__call__) Returning last
result
storageRefresh::WARNING::2013-07-14
16:55:04,344::fileUtils::185::fileUtils::(createdir) Dir
/rhev/data-center/hsm-tasks already exists
full engine logs and vdsm logs here
https://docs.google.com/file/d/0BwoPbcrMv8mvTXpFNmw3UG9NVzg/edit?usp=sharing
and
https://docs.google.com/file/d/0BwoPbcrMv8mvRDViZGo1S1hRcUU/edit?usp=sharing
thanks for any help.
I should use them tomorrow, they kept ok for many weeks, but Murphy is
always alive ;-)
Gianluca