On Wed, Apr 13, 2016 at 1:06 PM, Charles Tassell <ctassell(a)gmail.com> wrote:
Hi Everyone,
I'm having an issue with one of my hosts. I was performing a yum update
when the box crashed and rebooted, which left a lot of packages broken. I
resolved the issue by running "yum reinstall" on all the packages that were
in the update, but since then VDSM won't start. I can't find any error
message as to why it won't start, just that it won't. IE, I see this in
/var/log/messages:
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent:
WARNING:ovirt_hosted_engine_ha.agent.agent.Agent:Restarting agent, attempt
'5'
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent:
INFO:ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine:Found
certificate common name: ovirt-01.local.domain
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent:
INFO:ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine:Initializing
VDSM
Apr 13 07:57:09 ovirt-01 journal: ovirt-ha-agent
ovirt_hosted_engine_ha.agent.agent.Agent ERROR Service vdsmd is not running
and the admin is responsible for starting it. Waiting...
Apr 13 07:57:09 ovirt-01 ovirt-ha-agent:
ERROR:ovirt_hosted_engine_ha.agent.agent.Agent:Service vdsmd is not running
and the admin is responsible for starting it. Waiting...
And this in /var/log/vdsmd/mom.log (vdsmd.log is a 0 byte file)
2016-04-13 07:58:07,958 - mom - INFO - MOM starting
2016-04-13 07:58:07,992 - mom.HostMonitor - INFO - Host Monitor starting
2016-04-13 07:58:07,992 - mom - INFO - hypervisor interface vdsmxmlrpc
2016-04-13 07:58:08,089 - mom.vdsmInterface - ERROR - Cannot connect to
VDSM! [Errno 111] Connection refused
2016-04-13 07:58:08,090 - mom.GuestManager - INFO - Guest Manager starting:
multi-thread
2016-04-13 07:58:08,105 - mom.Policy - INFO - Loaded policy '00-defines'
2016-04-13 07:58:08,109 - mom.vdsmInterface - ERROR - Cannot connect to
VDSM! [Errno 111] Connection refused
2016-04-13 07:58:08,169 - mom.Policy - INFO - Loaded policy '02-balloon'
2016-04-13 07:58:08,263 - mom.Policy - INFO - Loaded policy '03-ksm'
2016-04-13 07:58:08,339 - mom.HostMonitor - INFO - HostMonitor is ready
2016-04-13 07:58:08,380 - mom.Policy - INFO - Loaded policy '04-cputune'
2016-04-13 07:58:08,381 - mom.PolicyEngine - INFO - Policy Engine starting
2016-04-13 07:58:08,383 - mom.RPCServer - INFO - Using unix socket
/var/run/vdsm/mom-vdsm.sock
2016-04-13 07:58:08,383 - mom.RPCServer - INFO - RPC Server starting
2016-04-13 07:58:08,884 - mom.RPCServer - INFO - RPC Server ending
Can anyone give me some ideas of how to debug this? Or even some docs on
how to properly remove the host from the manager app and re-register the
guest to see if that fixes things up.
You can run hosted-engine --deploy again and choose to redeploy using
the same host-id: it will reconfigure VDSM for you.
_______________________________________________
Users mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users