So only two things that jump out are just
1) ovirt-ha-agent not starting... back to python sillyness that I have no idea on debug
[root@medusa ~]# systemctl status ovirt-ha-agent.service
● ovirt-ha-agent.service - oVirt Hosted Engine High Availability Monitoring Agent
Loaded: loaded (/usr/lib/systemd/system/ovirt-ha-agent.service; enabled; vendor preset:
disabled)
Active: activating (auto-restart) (Result: exit-code) since Fri 2021-01-15 11:54:52
EST; 6s ago
Process: 16116 ExecStart=/usr/share/ovirt-hosted-engine-ha/ovirt-ha-agent (code=exited,
status=157)
Main PID: 16116 (code=exited, status=157)
[root@medusa ~]# tail /var/log/messages
Jan 15 11:55:02 medusa systemd[1]: Started oVirt Hosted Engine High Availability
Monitoring Agent.
Jan 15 11:55:02 medusa journal[16137]: ovirt-ha-agent
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Failed to start necessary
monitors
Jan 15 11:55:02 medusa journal[16137]: ovirt-ha-agent
ovirt_hosted_engine_ha.agent.agent.Agent ERROR Traceback (most recent call last):#012
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py",
line 85, in start_monitor#012 response = self._proxy.start_monitor(type, options)#012
File "/usr/lib64/python3.6/xmlrpc/client.py", line 1112, in __call__#012
return self.__send(self.__name, args)#012 File
"/usr/lib64/python3.6/xmlrpc/client.py", line 1452, in __request#012
verbose=self.__verbose#012 File "/usr/lib64/python3.6/xmlrpc/client.py", line
1154, in request#012 return self.single_request(host, handler, request_body,
verbose)#012 File "/usr/lib64/python3.6/xmlrpc/client.py", line 1166, in
single_request#012 http_conn = self.send_request(host, handler, request_body,
verbose)#012 File "/usr/lib64/python3.6/xmlrpc/client.py", line 1279, in
send_request#012 self.send_content(connection, request_body)#012 File
"/usr/lib64/python3.6/xmlrpc/client.py", line 1309, in send_content#012
connection.endheaders(request_body)#012 File
"/usr/lib64/python3.6/http/client.py", line 1264, in endheaders#012
self._send_output(message_body, encode_chunked=encode_chunked)#012 File
"/usr/lib64/python3.6/http/client.py", line 1040, in _send_output#012
self.send(msg)#012 File "/usr/lib64/python3.6/http/client.py", line 978, in
send#012 self.connect()#012 File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/unixrpc.py", line
74, in connect#012 self.sock.connect(base64.b16decode(self.host))#012FileNotFoundError:
[Errno 2] No such file or directory#012#012During handling of the above exception, another
exception occurred:#012#012Traceback (most recent call last):#012 File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/agent/agent.py", line
131, in _run_agent#012 return action(he)#012 File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/agent/agent.py", line
55, in action_proper#012 return he.start_monitoring()#012 File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/agent/hosted_engine.py",
line 437, in start_monitoring#012 self._initialize_broker()#012 File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/agent/hosted_engine.py",
line 561, in _initialize_broker#012 m.get('options', {}))#012 File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py",
line 91, in start_monitor#012 ).format(t=type, o=options,
e=e)#012ovirt_hosted_engine_ha.lib.exceptions.RequestError: brokerlink - failed to start
monitor via ovirt-ha-broker: [Errno 2] No such file or directory, [monitor:
'network', options: {'addr': '172.16.100.1',
'network_test': 'dns', 'tcp_t_address': '',
'tcp_t_port': ''}]
Jan 15 11:55:02 medusa journal[16137]: ovirt-ha-agent
ovirt_hosted_engine_ha.agent.agent.Agent ERROR Trying to restart agent
Jan 15 11:55:02 medusa systemd[1]: ovirt-ha-agent.service: Main process exited,
code=exited, status=157/n/a
Jan 15 11:55:02 medusa systemd[1]: ovirt-ha-agent.service: Failed with result
'exit-code'.
Jan 15 11:55:05 medusa upsmon[1530]: Poll UPS [nutmonitor(a)172.16.100.102] failed -
[nutmonitor] does not exist on server 172.16.100.102
Jan 15 11:55:06 medusa vdsm[14589]: WARN unhandled write event
Jan 15 11:55:08 medusa vdsm[14589]: WARN unhandled close event
Jan 15 11:55:10 medusa upsmon[1530]: Poll UPS [nutmonitor(a)172.16.100.102] failed -
[nutmonitor] does not exist on server 172.16.100.102
2) Notes about vdsmd host engine "setup not finished"... but this may be issue
of ha-agent as source
[root@medusa ~]# systemctl status vdsmd.service
● vdsmd.service - Virtual Desktop Server Manager
Loaded: loaded (/usr/lib/systemd/system/vdsmd.service; enabled; vendor preset:
disabled)
Active: active (running) since Fri 2021-01-15 11:49:27 EST; 6min ago
Main PID: 14589 (vdsmd)
Tasks: 72 (limit: 410161)
Memory: 77.8M
CGroup: /system.slice/vdsmd.service
├─14589 /usr/bin/python3 /usr/share/vdsm/vdsmd
├─14686 /usr/libexec/ioprocess --read-pipe-fd 43 --write-pipe-fd 42
--max-threads 10 --max-queued-requests 10
├─14691 /usr/libexec/ioprocess --read-pipe-fd 46 --write-pipe-fd 43
--max-threads 10 --max-queued-requests 10
├─14698 /usr/libexec/ioprocess --read-pipe-fd 51 --write-pipe-fd 50
--max-threads 10 --max-queued-requests 10
├─14705 /usr/libexec/ioprocess --read-pipe-fd 57 --write-pipe-fd 56
--max-threads 10 --max-queued-requests 10
└─14717 /usr/libexec/ioprocess --read-pipe-fd 64 --write-pipe-fd 63
--max-threads 10 --max-queued-requests 10
Jan 15 11:55:43 medusa.penguinpages.local vdsm[14589]: WARN Failed to retrieve Hosted
Engine HA info, is Hosted Engine setup finished?
Jan 15 11:55:45 medusa.penguinpages.local vdsm[14589]: WARN unhandled close event
Jan 15 11:55:55 medusa.penguinpages.local vdsm[14589]: WARN unhandled write event
Jan 15 11:55:57 medusa.penguinpages.local vdsm[14589]: WARN unhandled close event
Jan 15 11:55:58 medusa.penguinpages.local vdsm[14589]: WARN Failed to retrieve Hosted
Engine HA info, is Hosted Engine setup finished?
Jan 15 11:56:07 medusa.penguinpages.local vdsm[14589]: WARN unhandled write event
Jan 15 11:56:09 medusa.penguinpages.local vdsm[14589]: WARN unhandled close event
Jan 15 11:56:14 medusa.penguinpages.local vdsm[14589]: WARN Failed to retrieve Hosted
Engine HA info, is Hosted Engine setup finished?
Jan 15 11:56:19 medusa.penguinpages.local vdsm[14589]: WARN unhandled write event
Jan 15 11:56:21 medusa.penguinpages.local vdsm[14589]: WARN unhandled close event
[root@medusa ~]#
Is their a command line means to tell the engine to start.. or to start a virtual machine
directly?