[ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

Simone Tiraboschi stirabos at redhat.com
Thu Oct 29 13:08:22 UTC 2015


Hi Robert,
it seams that two hosts are fighting fir the same host ID:

MainThread::INFO::2015-10-27
09:14:56,764::hosted_engine::562::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_sanlock)
Ensuring lease for lockspace hosted-engine, host id 1 is acquired (file:
/var/run/vdsm/storage/2daba0ab-2b3d-4026-bcfc-1cd071c30038/04b08c8e-657f-4bac-9ddf-c9c57373409c/2d7f5020-42c1-442d-8237-fba9d6787080)
MainThread::ERROR::2015-10-27
09:14:56,766::hosted_engine::578::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_sanlock)
cannot get lock on host id 1: host already holds lock on a different host id
MainThread::ERROR::2015-10-27
09:14:56,767::agent::177::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Error: '(22, 'Sanlock lockspace add failure', 'Invalid argument')' - trying
to restart agent
MainThread::WARNING::2015-10-27
09:15:01,772::agent::180::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Restarting agent, attempt '9'
MainThread::ERROR::2015-10-27
09:15:01,772::agent::182::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Too many errors occurred, giving up. Please review the log and consider
filing a bug.
MainThread::INFO::2015-10-27
09:15:01,773::agent::121::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
Agent shutting down

can you please share the output of: hosted-engine --vm-status


On Thu, Oct 29, 2015 at 1:57 PM, Robert Story <rstory at tislabs.com> wrote:

> On Tue, 27 Oct 2015 09:45:28 -0400 Robert wrote:
> RS> I have oVirt 3.5.4 on CentOS 7.1 hosts, and everyone once in a while
> RS> one of my hosts starts sending me the 4 engine status messages above
> RS> about every 10-15 minutes.
>
> I upgraded the engine and all hosts to 3.5.5, and then 2 hosts started
> sending me 4 emails every 10-15 minutes. Currently I'm running with the
> engine in global maintenance to keep my inbox from overflowing with these
> messages.
>
> Any suggestions on how to get this under control appreciated...
>
>
> Robert
>
> --
> Senior Software Engineer @ Parsons
>
> _______________________________________________
> Users mailing list
> Users at ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20151029/249d9dc0/attachment-0001.html>


More information about the Users mailing list