
Hello all, I have an ovirt 4.4 installation whit self-hosted engine where the agent seems to have died. The VMs are still running, so I assume that the engine itself is still running (is this a wrong assumption ?). Can I restart the agent without affecting the running VMs, that is how will restarting the agent affect the running VMs ? If I can restart the agent, what's the correct way of doing it ? Thanks, -- Valerio Luccio High Performance Computing 10 Astor Place, Room 416D New York University New York, NY 10003 "In an open world, who needs windows or gates ?"

On Wed, 15 Jun 2022, Valerio Luccio wrote:
I have an ovirt 4.4 installation whit self-hosted engine where the agent seems to have died. The VMs are still running, so I assume that the engine itself is still running (is this a wrong assumption ?). Can I restart the agent without affecting the running VMs, that is how will restarting the agent affect the running VMs ? If I can restart the agent, what's the correct way of doing it ?
If the engine is down, the VMs will continue to run. You just won't be able to start/migrate/configure/etc them. If the engine VM is still running, you may want to SSH into it and look at the state of the system to see if you can see what went wrong. systemctl status 'ovirt*' may tell you if just a single service is down (such as ovirt-engine). You might also check the logs in /var/log/ovirt-engine If the VM is down or inaccessible, SSH into one of the hosts capable of running the engine and run the following to check VM status: hosted-engine --vm-status This should tell you if and whre it's running. If it isn't dead, You can stop it with: hosted-engine --vm-shutdown Check the status and wait until it's actually down, then you can start it up again: hosted-engine --vm-start

Thanks for the reply. I had already checked what you suggested. I didn't realize that the engine was used only for management, I had assumed that if it crashes it would take down all the VMs. ovirt-ha-broker seems to have crashed. In the borker.log I see: RuntimeError: Couldn't connect to VDSM within 60 seconds Thread-3::ERROR::2022-06-16 08:35:02,857::submonitor_base::119::ovirt_hosted_engine_ha.broker.submonitor_base.SubmonitorBase::(_worker) Error executing submonitor mem-free, args {'address': '0', 'use_ssl': 'true'} hosted-engine --vm-status tells me that the agent has stopped. If I do "hosted-engine --vm-start" what will happen to the running VMs ? Since it's only used for management, if it detects that they are already running, will it ignore them and go on it's merry way ? I'm trying to avoid shutting down all the servers if I can help it. On 6/15/22 11:07 PM, Sketch wrote:
On Wed, 15 Jun 2022, Valerio Luccio wrote:
I have an ovirt 4.4 installation whit self-hosted engine where the agent seems to have died. The VMs are still running, so I assume that the engine itself is still running (is this a wrong assumption ?). Can I restart the agent without affecting the running VMs, that is how will restarting the agent affect the running VMs ? If I can restart the agent, what's the correct way of doing it ?
If the engine is down, the VMs will continue to run. You just won't be able to start/migrate/configure/etc them.
If the engine VM is still running, you may want to SSH into it and look at the state of the system to see if you can see what went wrong.
systemctl status 'ovirt*' may tell you if just a single service is down (such as ovirt-engine). You might also check the logs in /var/log/ovirt-engine
If the VM is down or inaccessible, SSH into one of the hosts capable of running the engine and run the following to check VM status:
hosted-engine --vm-status
This should tell you if and whre it's running. If it isn't dead, You can stop it with:
hosted-engine --vm-shutdown
Check the status and wait until it's actually down, then you can start it up again:
hosted-engine --vm-start
-- Valerio Luccio High Performance Computing 10 Astor Place, Room 416D New York University New York, NY 10003 "In an open world, who needs windows or gates ?"
participants (2)
-
Sketch
-
Valerio Luccio