Hi,
I'm running an oVirt hosted-engine environment on 3 hosts. To test VMs'
HA functionality, I shutdown host02's link, where one of my HA VMs is
running on, using this command:
2016-05-10 09:59:19 ICT [root@host02 ~]# ip link set bond0 down
Few seconds later, an attempt to fence host02 was issued, and this entry
appears in the web UI event tab "May 10, 2016 10:00:34 ... Executing
power management status on Host hosted_engine_2 using Proxy Host
hosted_engine_1 and Fence Agent ipmilan:172.16.3.5.". The IP
"172.16.3.5" was correct the Fence Agent "ipmilan" was not.
Even though a failure message "May 10, 2016 10:00:36 ... Execution of
power management status on Host hosted_engine_2 using Proxy Host
hosted_engine_1 and Fence Agent ipmilan:172.16.3.5 failed." appears in
the web UI event tab, host02 was successfully powered off.
The last message in the web GUI event tab is "May 10, 2016 10:00:40 AM
... Host hosted_engine_2 is rebooting.", but the host wasn't actually
rebooted - I have to boot it manually using iLo web UI.
How can fix this issue in order to make VMs' HA work?
Thank you.
Here is my power management settings:
hosted_engine_1 -> ilo4 : 172.16.3.4
hosted_engine_2 -> ilo4 : 172.16.3.5
hosted_engine_3 -> ilo4 : 172.16.3.6
Here are the log files:
https://app.box.com/s/fs5let8955rjbcuxuy0p42ixj4dzou6m
[root@engine ~]# rpm -qa | grep ovirt
ovirt-engine-wildfly-8.2.1-1.el7.x86_64
ovirt-engine-setup-plugin-ovirt-engine-common-3.6.5.3-1.el7.centos.noarch
ovirt-vmconsole-1.0.0-1.el7.centos.noarch
ovirt-engine-cli-3.6.2.0-1.el7.centos.noarch
ovirt-engine-setup-plugin-ovirt-engine-3.6.5.3-1.el7.centos.noarch
ovirt-engine-backend-3.6.5.3-1.el7.centos.noarch
ovirt-iso-uploader-3.6.0-1.el7.centos.noarch
ovirt-engine-extensions-api-impl-3.6.5.3-1.el7.centos.noarch
ovirt-host-deploy-1.4.1-1.el7.centos.noarch
ovirt-release36-007-1.noarch
ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch
ovirt-image-uploader-3.6.0-1.el7.centos.noarch
ovirt-engine-extension-aaa-jdbc-1.0.6-1.el7.noarch
ovirt-setup-lib-1.0.1-1.el7.centos.noarch
ovirt-host-deploy-java-1.4.1-1.el7.centos.noarch
ovirt-engine-setup-base-3.6.5.3-1.el7.centos.noarch
ovirt-engine-setup-plugin-websocket-proxy-3.6.5.3-1.el7.centos.noarch
ovirt-engine-tools-backup-3.6.5.3-1.el7.centos.noarch
ovirt-vmconsole-proxy-1.0.0-1.el7.centos.noarch
ovirt-engine-vmconsole-proxy-helper-3.6.5.3-1.el7.centos.noarch
ovirt-engine-setup-3.6.5.3-1.el7.centos.noarch
ovirt-engine-webadmin-portal-3.6.5.3-1.el7.centos.noarch
ovirt-engine-tools-3.6.5.3-1.el7.centos.noarch
ovirt-engine-restapi-3.6.5.3-1.el7.centos.noarch
ovirt-engine-3.6.5.3-1.el7.centos.noarch
ovirt-guest-agent-common-1.0.11-1.el7.noarch
ovirt-engine-wildfly-overlay-8.0.5-1.el7.noarch
ovirt-engine-lib-3.6.5.3-1.el7.centos.noarch
ovirt-engine-websocket-proxy-3.6.5.3-1.el7.centos.noarch
ovirt-engine-setup-plugin-vmconsole-proxy-helper-3.6.5.3-1.el7.centos.noarch
ovirt-engine-userportal-3.6.5.3-1.el7.centos.noarch
ovirt-engine-dbscripts-3.6.5.3-1.el7.centos.noarch
[root@host03 ~]# rpm -qa | grep ovirt
ovirt-vmconsole-1.0.0-1.el7.centos.noarch
ovirt-host-deploy-1.4.1-1.el7.centos.noarch
ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch
ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch
ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch
libgovirt-0.3.3-1.el7_2.1.x86_64
ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch
ovirt-setup-lib-1.0.1-1.el7.centos.noarch
[root@host03 ~]# rpm -qa | grep vdsm
vdsm-cli-4.17.26-1.el7.noarch
vdsm-4.17.26-1.el7.noarch
vdsm-infra-4.17.26-1.el7.noarch
vdsm-xmlrpc-4.17.26-1.el7.noarch
vdsm-yajsonrpc-4.17.26-1.el7.noarch
vdsm-hook-vmfex-dev-4.17.26-1.el7.noarch
vdsm-python-4.17.26-1.el7.noarch
vdsm-jsonrpc-4.17.26-1.el7.noarch
--
Wee