<html><head></head><body><div style="font-family:Helvetica Neue, Helvetica, Arial, sans-serif;font-size:10px;"><div>Hello all,</div><div><br></div><div>I just upgraded my OVIRT instance to 4.2, the engine completed successfully, however after I upgraded the hosts the HA Broker will not start.&nbsp; The 2 hosts are running CentOS 7.4, running gluster and CTDB.&nbsp; The VIPS are up and can be reached from both hosts as well as I can mount the gluster storage. &nbsp; </div><div><br></div><div>The error from the agent.log: </div><div><br></div><div>MainThread::INFO::2017-12-20 21:02:19,219::agent::67::ovirt_hosted_engine_ha.agent.agent.Agent::(run) ovirt-hosted-engine-ha agent 2.2.2 started<br>MainThread::INFO::2017-12-20 21:02:19,346::hosted_engine::243::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_get_hostname) Found certificate common name: hm3svr01.hm3.loc<br>MainThread::INFO::2017-12-20 21:02:20,478::hosted_engine::525::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_broker) Initializing ha-broker connection<br>MainThread::INFO::2017-12-20 21:02:20,482::brokerlink::77::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(start_monitor) Starting monitor ping, options {'addr': '192.168.3.1'}<br>MainThread::ERROR::2017-12-20 21:02:20,483::hosted_engine::538::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_broker) Failed to start necessary monitors<br>MainThread::ERROR::2017-12-20 21:02:20,485::agent::144::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent) Traceback (most recent call last):<br>&nbsp; File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/agent/agent.py", line 131, in _run_agent<br>&nbsp;&nbsp;&nbsp; return action(he)<br>&nbsp; File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/agent/agent.py", line 55, in action_proper<br>&nbsp;&nbsp;&nbsp; return he.start_monitoring()<br>&nbsp; File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/agent/hosted_engine.py", line 416, in start_monitoring<br>&nbsp;&nbsp;&nbsp; self._initialize_broker()<br>&nbsp; File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/agent/hosted_engine.py", line 535, in _initialize_broker<br>&nbsp;&nbsp;&nbsp; m.get('options', {}))<br>&nbsp; File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py", line 83, in start_monitor<br>&nbsp;&nbsp;&nbsp; .format(type, options, e))<br>RequestError: Failed to start monitor ping, options {'addr': '192.168.x.x'}: [Errno 2] No such file or directory<br><div><br></div><div><br></div><div>The broker.log:</div><div><br></div><div>MainThread::INFO::2017-12-20 23:06:19,405::monitor::50::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Finished loading submonitors<br>MainThread::INFO::2017-12-20 23:06:20,324::storage_backends::346::ovirt_hosted_engine_ha.lib.storage_backends::(connect) Connecting the storage<br>MainThread::INFO::2017-12-20 23:06:20,325::storage_server::252::ovirt_hosted_engine_ha.lib.storage_server.StorageServer::(connect_storage_server) Connecting storage server<br>MainThread::INFO::2017-12-20 23:06:20,849::storage_server::259::ovirt_hosted_engine_ha.lib.storage_server.StorageServer::(connect_storage_server) Connecting storage server<br>MainThread::WARNING::2017-12-20 23:06:20,913::storage_broker::96::ovirt_hosted_engine_ha.broker.storage_broker.StorageBroker::(__init__) Can't connect vdsm storage: Connection to storage server failed <br>MainThread::INFO::2017-12-20 23:06:22,087::broker::45::ovirt_hosted_engine_ha.broker.broker.Broker::(run) ovirt-hosted-engine-ha broker 2.2.2 started<br>MainThread::INFO::2017-12-20 23:06:22,088::monitor::40::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Searching for submonitors in /usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/broker/s<br>ubmonitors<br>MainThread::INFO::2017-12-20 23:06:22,089::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor cpu-load<br>MainThread::INFO::2017-12-20 23:06:22,093::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor cpu-load-no-engine<br>MainThread::INFO::2017-12-20 23:06:22,146::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor engine-health<br>MainThread::INFO::2017-12-20 23:06:22,147::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mem-free<br>MainThread::INFO::2017-12-20 23:06:22,147::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mem-load<br>MainThread::INFO::2017-12-20 23:06:22,148::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mgmt-bridge<br>MainThread::INFO::2017-12-20 23:06:22,149::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor ping<br>MainThread::INFO::2017-12-20 23:06:22,149::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor storage-domain<br>MainThread::INFO::2017-12-20 23:06:22,150::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor cpu-load<br>MainThread::INFO::2017-12-20 23:06:22,151::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor cpu-load-no-engine<br>MainThread::INFO::2017-12-20 23:06:22,152::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor engine-health<br>MainThread::INFO::2017-12-20 23:06:22,153::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mem-free<br>MainThread::INFO::2017-12-20 23:06:22,153::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mem-load<br>MainThread::INFO::2017-12-20 23:06:22,154::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mgmt-bridge<br>MainThread::INFO::2017-12-20 23:06:22,154::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor ping<br>MainThread::INFO::2017-12-20 23:06:22,155::monitor::49::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor storage-domain<br><br><div><br></div><div>The VDSM log has alot of JSON errors with the storage fai2017-12-20 23:13:00,311-0500 INFO&nbsp; (jsonrpc/6) [vdsm.api] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) from=::1,54630, task_id=ff009157-48f3-480c-b8fe-b8d0a791c922 (api:50)<br>2017-12-20 23:13:00,312-0500 ERROR (jsonrpc/6) [storage.TaskManager.Task] (Task='ff009157-48f3-480c-b8fe-b8d0a791c922') Unexpected error (task:875)<br>2017-12-20 23:13:00,314-0500 ERROR (jsonrpc/6) [storage.Dispatcher] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) (dispatcher:82)<br>2017-12-20 23:13:00,314-0500 INFO&nbsp; (jsonrpc/6) [jsonrpc.JsonRpcServer] RPC call StorageDomain.getInfo failed (error 358) in 0.48 seconds (__init__:573)<br>&nbsp;&nbsp;&nbsp; raise convert_to_error(kind, result)<br>2017-12-20 23:13:03,092-0500 INFO&nbsp; (jsonrpc/3) [vdsm.api] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) from=::1,54632, task_id=39e022e5-db99-4bc4-88e1-9a218104b3c7 (api:50)<br>2017-12-20 23:13:03,093-0500 ERROR (jsonrpc/3) [storage.TaskManager.Task] (Task='39e022e5-db99-4bc4-88e1-9a218104b3c7') Unexpected error (task:875)<br>2017-12-20 23:13:03,095-0500 ERROR (jsonrpc/3) [storage.Dispatcher] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) (dispatcher:82)<br>2017-12-20 23:13:03,095-0500 INFO&nbsp; (jsonrpc/3) [jsonrpc.JsonRpcServer] RPC call StorageDomain.getInfo failed (error 358) in 0.49 seconds (__init__:573)<br>&nbsp;&nbsp;&nbsp; raise convert_to_error(kind, result)<br>2017-12-20 23:13:07,568-0500 INFO&nbsp; (jsonrpc/4) [vdsm.api] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) from=::1,54640, task_id=c1b1b1a1-a7e6-494a-bda6-19c617820dec (api:50)<br>2017-12-20 23:13:07,569-0500 ERROR (jsonrpc/4) [storage.TaskManager.Task] (Task='c1b1b1a1-a7e6-494a-bda6-19c617820dec') Unexpected error (task:875)<br>2017-12-20 23:13:07,571-0500 ERROR (jsonrpc/4) [storage.Dispatcher] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) (dispatcher:82)<br>2017-12-20 23:13:07,571-0500 INFO&nbsp; (jsonrpc/4) [jsonrpc.JsonRpcServer] RPC call StorageDomain.getInfo failed (error 358) in 0.48 seconds (__init__:573)<br>&nbsp;&nbsp;&nbsp; raise convert_to_error(kind, result)<br>2017-12-20 23:13:10,323-0500 INFO&nbsp; (jsonrpc/0) [vdsm.api] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) from=::1,54642, task_id=6354fa3d-933c-4fd0-9301-00f8abd29ec7 (api:50)<br>2017-12-20 23:13:10,323-0500 ERROR (jsonrpc/0) [storage.TaskManager.Task] (Task='6354fa3d-933c-4fd0-9301-00f8abd29ec7') Unexpected error (task:875)<br>2017-12-20 23:13:10,325-0500 ERROR (jsonrpc/0) [storage.Dispatcher] FINISH getStorageDomainInfo error=Storage domain does not exist: (u'1cc6cc89-571e-4b6a-9d41-c742d763e1cc',) (dispatcher:82)<br>2017-12-20 23:13:10,326-0500 INFO&nbsp; (jsonrpc/0) [jsonrpc.JsonRpcServer] RPC call StorageDomain.getInfo failed (error 358) in 0.48 seconds (__init__:573)<br><br><div>ling</div><div><br></div><div><br></div><div>Any help is appreciated.&nbsp; </div><div><br></div><div>thanks Andy<br></div></div><div><br></div><div><br></div>&nbsp; </div><div><br></div><div><br></div><br></div></div></body></html>