[ovirt-users] VDSM Won't Start

Charles Tassell ctassell at gmail.com
Wed Apr 13 11:06:05 UTC 2016


Hi Everyone,

   I'm having an issue with one of my hosts.  I was performing a yum 
update when the box crashed and rebooted, which left a lot of packages 
broken.  I resolved the issue by running "yum reinstall" on all the 
packages that were in the update, but since then VDSM won't start.  I 
can't find any error message as to why it won't start, just that it 
won't.  IE, I see this in /var/log/messages:

Apr 13 07:57:09 ovirt-01 ovirt-ha-agent: 
WARNING:ovirt_hosted_engine_ha.agent.agent.Agent:Restarting agent, 
attempt '5'
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent: 
INFO:ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine:Found 
certificate common name: ovirt-01.local.domain
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent: 
INFO:ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine:Initializing VDSM
Apr 13 07:57:09 ovirt-01 journal: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.agent.Agent ERROR Service vdsmd is not 
running and the admin is responsible for starting it. Waiting...
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent: 
ERROR:ovirt_hosted_engine_ha.agent.agent.Agent:Service vdsmd is not 
running and the admin is responsible for starting it. Waiting...

And this in /var/log/vdsmd/mom.log (vdsmd.log is a 0 byte file)

2016-04-13 07:58:07,958 - mom - INFO - MOM starting
2016-04-13 07:58:07,992 - mom.HostMonitor - INFO - Host Monitor starting
2016-04-13 07:58:07,992 - mom - INFO - hypervisor interface vdsmxmlrpc
2016-04-13 07:58:08,089 - mom.vdsmInterface - ERROR - Cannot connect to 
VDSM! [Errno 111] Connection refused
2016-04-13 07:58:08,090 - mom.GuestManager - INFO - Guest Manager 
starting: multi-thread
2016-04-13 07:58:08,105 - mom.Policy - INFO - Loaded policy '00-defines'
2016-04-13 07:58:08,109 - mom.vdsmInterface - ERROR - Cannot connect to 
VDSM! [Errno 111] Connection refused
2016-04-13 07:58:08,169 - mom.Policy - INFO - Loaded policy '02-balloon'
2016-04-13 07:58:08,263 - mom.Policy - INFO - Loaded policy '03-ksm'
2016-04-13 07:58:08,339 - mom.HostMonitor - INFO - HostMonitor is ready
2016-04-13 07:58:08,380 - mom.Policy - INFO - Loaded policy '04-cputune'
2016-04-13 07:58:08,381 - mom.PolicyEngine - INFO - Policy Engine starting
2016-04-13 07:58:08,383 - mom.RPCServer - INFO - Using unix socket 
/var/run/vdsm/mom-vdsm.sock
2016-04-13 07:58:08,383 - mom.RPCServer - INFO - RPC Server starting
2016-04-13 07:58:08,884 - mom.RPCServer - INFO - RPC Server ending

Can anyone give me some ideas of how to debug this?  Or even some docs 
on how to properly remove the host from the manager app and re-register 
the guest to see if that fixes things up.





More information about the Users mailing list