[Users] Host stuck in unresponsive state

Frank Wall fw at moov.de
Sun Sep 1 14:39:36 UTC 2013


On 01.09.2013 01:28, Frank Wall wrote:
> OK, for some reason it got stuck trying to start "iscsid" and
> "multipathd". I was able to solve the issues with these services and
> now the real error message is visible:

Did some more fiddling... I removed my /etc/multipath.conf and started 
with the new file. Apparently there is a syntax error in this 
auto-generated config:

[root at aio ~]# multipath -ll
Sep 01 00:32:27 | multipath.conf +5, invalid keyword: getuid_callout
Sep 01 00:32:27 | multipath.conf +18, invalid keyword: getuid_callout

OK, I removed lines 5 and 18 and now multipathd is working again. This 
time it was possible to successfully start vdsmd afterwards:

[root at aio ~]# systemctl status vdsmd.service
vdsmd.service - Virtual Desktop Server Manager
    Loaded: loaded (/usr/lib/systemd/system/vdsmd.service; enabled)
    Active: active (running) since So 2013-09-01 16:25:45 CEST; 1min 30s 
ago
   Process: 3138 ExecStart=/lib/systemd/systemd-vdsmd start (code=exited, 
status=0/SUCCESS)
  Main PID: 3285 (respawn)
    CGroup: name=systemd:/system/vdsmd.service
            ├─3285 /bin/bash -e /usr/share/vdsm/respawn --minlifetime 10 
--daemon --masterpid /var/run/vdsm/respawn.pid /us...
            └─3288 /usr/bin/python /usr/share/vdsm/vdsm

Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 client step 2
Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 
parse_server_challenge()
Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 ask_user_info()
Sep 01 16:25:45 aio.exmaple.com vdsm[3288]: vdsm vds WARNING Unable to 
load the json rpc server module. Please make su...alled.
Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 client step 2
Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 ask_user_info()
Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 
make_client_response()
Sep 01 16:25:45 aio.exmaple.com python[3288]: DIGEST-MD5 client step 3
Sep 01 16:25:54 aio.exmaple.com vdsm[3288]: vdsm TaskManager.Task ERROR 
Task=`7fc3840c-1518-4260-9f27-ee20434b5a7a`::U... error
Sep 01 16:25:54 aio.exmaple.com vdsm[3288]: vdsm TaskManager.Task ERROR 
Task=`82f757b5-a669-40fa-b09d-9cad90c971e1`::U... error


Still, this doesn't feel right. I think vdsmd is just too unstable and 
vulnerable. Why did vdsmd core dump with another multipathd config in 
place? Why does it even have this strict dependency on multipathd?

There have been severel similar reports in the last months and I wonder 
if there is a way to make vdsmd just more stable. It would be better to 
have vdsmd started and report an error to ovirt-engine, instead of 
failing to start the vdsmd service all the time. The current behaviour 
makes it hard to debug.


Thanks
- Frank



More information about the Users mailing list