How restore nodes ovirt UP from NonResponsive and VMs executing

Hello, I have ovirt with two nodes that are NonResponsive and all the VMs are properly executing but i cant manage them because are in Unknown state. It seems that nodes lost connection for a while with their gateway. I have thought of first restarting the node where the engine is not running and trying to put in UP. Then restart the engine from within de VM to see if it starts up on this node. What is the proper way of restoring management? I Thanks, Best Regards -- Saludos, José Pascual Gallud Martínez Nombre | Dpto. Ingeniería <http://telfy.com/>

Hi, Here is my video tutorial for solution of this problem. https://www.youtube.com/watch?v=vm55caHxRj8
On 27 Jul 2023, at 18:53, José Pascual <josepascual@telfy.com> wrote:
Hello,
I have ovirt with two nodes that are NonResponsive and all the VMs are properly executing but i cant manage them because are in Unknown state. It seems that nodes lost connection for a while with their gateway.
I have thought of first restarting the node where the engine is not running and trying to put in UP. Then restart the engine from within de VM to see if it starts up on this node.
What is the proper way of restoring management? I
<RsLHoh8yIxkYLKnz.png>
<c0Se9l6YJHznQSCJ.png>
Thanks, Best Regards -- Saludos, José Pascual Gallud Martínez
<http://telfy.com/>_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/WXRPVJUZIELYQF...

Hello, The node (ovirt2) however is having consistent problems. The follow sequence of events is reproducible and is causing the host to enter a "NonOperational" state on the cluster: * Host ovirt2 installed * VDSM ovirt2 command ConnectStorageServerVDS failed: Message timeout which can be caused by communication issues * Host ovirt2 is not responding. Host cannot be fenced automatically because power management for the host is disabled. * Host ovirt2 cannot access the Storage Domain(s) <UNKNOWN> attached to the Data Center DataCenter1. Setting Host state to Non-Operational. (5/27/1912:43:22 PM) * (Banner appears in GUI) Failed Activating Host ovirt2.witsconsult.com * Failed to connect Host ovirt2 to Storage Pool DataCenter1 (5/27/1912:47:07 PM) * Host ovirt2 cannot access the Storage Domain(s) <UNKNOWN> attached to the Data Center DataCenter1. Setting Host state to Non-Operational. (5/27/1912:47:07 PM) * Host ovirt2 is not responding. Host cannot be fenced automatically because power management for the host is disabled. (5/27/1912:47:07 PM) * VDSM ovirt2 command ConnectStorageServerVDS failed: Message timeout which can be caused by communication issues (5/27/1912:47:07 PM) I can then re-activate ovirt2, which appears as green for approximately 5 minutes and then repeats all of the above issues. What can I do to troubleshoot this?
participants (3)
-
Andrei Verovski
-
carlos.mendes@mgo.cv
-
José Pascual