I have a two-node cluster with one host (ovirt1) running the oVirt engine and the other
running the most recent oVirt Node (4.3.2 from March 2019) software. The engine host has
the following information:
OS Version:
RHEL - 7 - 6.1810.2.el7.centos
OS Description:
CentOS Linux 7 (Core)
Kernel Version: 3.10.0 - 957.12.2.el7.x86_64
KVM Version: 2.12.0 - 18.el7_6.5.1
LIBVIRT Version: libvirt-4.5.0-10.el7_6.9
VDSM Version: vdsm-4.30.13-1.el7
SPICE Version: 0.14.0 - 6.el7_6.1
GlusterFS Version: [N/A]
CEPH Version: librbd1-10.2.5-4.el7
Open vSwitch Version: openvswitch-2.10.1-3.el7
Kernel Features: PTI: 1, IBRS: 0, RETP: 1, SSBD: 3
VNC Encryption: Enabled
Aside from the other issue I raised of seeing the "Uncaught exception occurred.
Please try reloading the page. Details: (TypeError) : Cannot read property 'a' of
null
Please have your administrator check the UI logs" error on the main host, ovirt1
appears to functioning normally.
The node (ovirt2) however is having consistent problems. The follow sequence of events is
reproducible and is causing the host to enter a "NonOperational" state on the
cluster:
* Host ovirt2 installed
* VDSM ovirt2 command ConnectStorageServerVDS failed: Message timeout which can be caused
by communication issues
* Host ovirt2 is not responding. Host cannot be fenced automatically because power
management for the host is disabled.
* Host ovirt2 cannot access the Storage Domain(s) <UNKNOWN> attached to the Data
Center DataCenter1. Setting Host state to Non-Operational. (5/27/1912:43:22 PM)
* (Banner appears in GUI) Failed Activating Host
ovirt2.witsconsult.com
* Failed to connect Host ovirt2 to Storage Pool DataCenter1 (5/27/1912:47:07 PM)
* Host ovirt2 cannot access the Storage Domain(s) <UNKNOWN> attached to the Data
Center DataCenter1. Setting Host state to Non-Operational. (5/27/1912:47:07 PM)
* Host ovirt2 is not responding. Host cannot be fenced automatically because power
management for the host is disabled. (5/27/1912:47:07 PM)
* VDSM ovirt2 command ConnectStorageServerVDS failed: Message timeout which can be caused
by communication issues (5/27/1912:47:07 PM)
I can then re-activate ovirt2, which appears as green for approximately 5 minutes and then
repeats all of the above issues.
What can I do to troubleshoot this?