[Users] host remains in status unassigned

14 Jul 2013

      Hello,

oVirt 3.2.1 on F18
One mgmt server + one host.

At some point the host had some problem and was rebooted
at reboot ther was a problem because I got

detected unhandled Python exception in '/usr/share/vdsm/vdsm'

watching at
http://www.ovirt.org/Troubleshooting

I verified that for some reason vdsmd.log was root.root.
So I set correct permissions and restarted the node.

Now it stays at "unassigned" status from an engine point of view, that
I don't know what means...
In node messages

Jul 14 16:55:02 management02 vdsm vds WARNING Unable to load the json
rpc server module. Please make sure it is installed.

[root@management02 ~]# systemctl status vdsmd
vdsmd.service - Virtual Desktop Server Manager
      Loaded: loaded (/usr/lib/systemd/system/vdsmd.service; enabled)
      Active: active (running) since Sun 2013-07-14 16:54:58 CEST; 6min ago
     Process: 1015 ExecStart=/lib/systemd/systemd-vdsmd start
(code=exited, status=0/SUCCESS)
    Main PID: 2128 (respawn)
      CGroup: name=systemd:/system/vdsmd.service
          ├─2128 /bin/bash -e /usr/share/vdsm/respawn --minlifetime 10
--daemon --masterpid /var/run/vdsm/respawn....
          ├─2131 /usr/bin/python /usr/share/vdsm/vdsm
          ├─2151 /usr/bin/sudo -n /usr/bin/python
/usr/share/vdsm/supervdsmServer.py 62a4a000-a73a-4e2c-b1a8-25e24...
          └─2152 /usr/bin/python /usr/share/vdsm/supervdsmServer.py
62a4a000-a73a-4e2c-b1a8-25e24587b558 2131 /var...

Jul 14 16:54:58 management02.localdomain.local runuser[2125]:
pam_unix(runuser:session): session opened for user vdsm ...d=0)
Jul 14 16:54:58 management02.localdomain.local runuser[2125]:
pam_unix(runuser:session): session closed for user vdsm
Jul 14 16:54:58 management02.localdomain.local systemd-vdsmd[1015]:
[27B blob data]
Jul 14 16:54:58 management02.localdomain.local systemd[1]: Started
Virtual Desktop Server Manager.
Jul 14 16:54:59 management02.localdomain.local python[2131]:
DIGEST-MD5 client step 2
Jul 14 16:54:59 management02.localdomain.local python[2131]:
DIGEST-MD5 client step 2
Jul 14 16:54:59 management02.localdomain.local python[2131]:
DIGEST-MD5 client step 3
Jul 14 16:54:59 management02.localdomain.local vdsm[2131]: vdsm
fileUtils WARNING Dir /rhev/data-center/mnt already exists
Jul 14 16:55:02 management02.localdomain.local vdsm[2131]: vdsm vds
WARNING Unable to load the json rpc server module....led.
Jul 14 16:55:04 management02.localdomain.local vdsm[2131]: vdsm
fileUtils WARNING Dir /rhev/data-center/hsm-tasks alre...ists

In vdsm.log

torageRefresh::DEBUG::2013-07-14
16:55:01,919::misc::1064::SamplingMethod::(__call__) Returning last
result
MainProcess|storageRefresh::DEBUG::2013-07-14
16:55:01,921::misc::84::Storage.Misc.excCmd::(<lambda>) '/usr/bin/dd
of=/sys/class
/scsi_host/host0/scan' (cwd None)
MainProcess|storageRefresh::DEBUG::2013-07-14
16:55:01,932::iscsi::402::Storage.ISCSI::(forceIScsiScan) Performing
SCSI scan, th
is will take up to 30 seconds
MainThread::INFO::2013-07-14 16:55:02,085::momIF::36::MOM::(__init__)
Starting up MOM
MainThread::INFO::2013-07-14
16:55:02,091::vmChannels::140::vds::(settimeout) Setting channels'
timeout to 30 seconds.
VM Channels Listener::INFO::2013-07-14
16:55:02,091::vmChannels::128::vds::(run) Starting VM channels
listener thread.
MainThread::WARNING::2013-07-14
16:55:02,139::clientIF::175::vds::(_prepareBindings) Unable to load
the json rpc server module. Please make sure it is installed.
storageRefresh::DEBUG::2013-07-14
16:55:04,290::misc::84::Storage.Misc.excCmd::(<lambda>) '/usr/bin/sudo
-n /sbin/multipath' (cwd None)
storageRefresh::DEBUG::2013-07-14
16:55:04,341::misc::84::Storage.Misc.excCmd::(<lambda>) SUCCESS: <err>
= ''; <rc> = 0
storageRefresh::DEBUG::2013-07-14
16:55:04,342::lvm::477::OperationMutex::(_invalidateAllPvs) Operation
'lvm invalidate operation' got the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,342::lvm::479::OperationMutex::(_invalidateAllPvs) Operation
'lvm invalidate operation' released the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,342::lvm::488::OperationMutex::(_invalidateAllVgs) Operation
'lvm invalidate operation' got the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,343::lvm::490::OperationMutex::(_invalidateAllVgs) Operation
'lvm invalidate operation' released the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,343::lvm::508::OperationMutex::(_invalidateAllLvs) Operation
'lvm invalidate operation' got the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,343::lvm::510::OperationMutex::(_invalidateAllLvs) Operation
'lvm invalidate operation' released the operation mutex
storageRefresh::DEBUG::2013-07-14
16:55:04,344::misc::1064::SamplingMethod::(__call__) Returning last
result
storageRefresh::WARNING::2013-07-14
16:55:04,344::fileUtils::185::fileUtils::(createdir) Dir
/rhev/data-center/hsm-tasks already exists

full engine logs and vdsm logs here
https://docs.google.com/file/d/0BwoPbcrMv8mvTXpFNmw3UG9NVzg/edit?usp=sharing
and
https://docs.google.com/file/d/0BwoPbcrMv8mvRDViZGo1S1hRcUU/edit?usp=sharing

thanks for any help.

I should use them tomorrow, they kept ok for many weeks, but Murphy is
always alive ;-)

Gianluca

Gianluca Cecchi

Joop

Gianluca Cecchi

Roy Golan

Gianluca Cecchi

Roy Golan

tags

participants (3)