Hi,
Just run into the issue during cluster upgrade from 4.24 to 4.2.6.1. I'm
running small cluster with 2 hosts and gluster storage. Once I upgraded one
of the hosts to 4.2.6.1 something went wrong (looks like it tried to start
HE instance) and I can't connect to hosted-engine any longer.
As I can see HostedEngine is still running on the second host (and another
yet 7 VM's) , but I can't stop it.
ovirt-ha-agent and ovirt-ha-broker are failing to start. hosted-engine
--vm-status gives nothing but error message
"The hosted engine configuration has not been retrieved from shared
storage. Please ensure that ovirt-ha-agent is running and the storage
server is reachable."
ps -ef shows plenty of vdsm processes in defunc state thats probably the
reason why agent and brocker can't start. Just wondering that is the good
way to start problem resolution here to minimize downtime for running VM's?
Restart vdsm and try again restarting agent and broker or just reboot the
whole host?
Regards,
Artem