It seems my first post didn't go through correctly, probably my fault. Here is the
information, just incase it was a complete fail.
I tend to go through and check everything possible. I check all the
features of a fresh system before I start to add new VM's. So here is
the trouble I am having then.
I have a 3 node cluster. Each node is running Ovirt-Node iso ver 4.3.0
upgraded to version 4.3.1. Hosted engine matches version. The only VM
on the entire cluster is hosted engine. I have over 140GB of memory
free and 1.2TB of HDD.
If I put a host into Maintenace hosted engine will automaticly migrate
it takes about 5 minutes. If I try a manual migration 1 hour later it
fails at 99%.
Here is the event log from the host.
Migrate Hosted Engine: Started @ 6:34:25 AM
From
example1.com to
example2.com
@ 7:23 AM it still has not completed Has been @
99% for 20 minutes
Migration failed (VM: HostedEngine, Source:
example1.com, Destination:
example2.com).
ovirt-ha-agent.service Mon Mar 11 2019 07:37:03
GMT-0500 (Central Daylight Time)
ovirt-ha-agent
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Migration
failed: {u'status': {u'message': u'Migration canceled',
u'code': 47},
u'progress': 99}
PRIORITY 3
SYSLOG_FACILITY 1
_BOOT_ID c0369e82837644219ec7846b1a26e552
_CAP_EFFECTIVE 0
_CMDLINE /usr/bin/python /usr/share/ovirt-hosted-engine-
ha/ovirt-ha-agent
_COMM ovirt-ha-agent
_EXE /usr/bin/python2.7
_GID 36
_HOSTNAME
example1.com
_MACHINE_ID 630bb52e187843d3b73684c3704397c4
_PID 21721
_SELINUX_CONTEXT system_u:system_r:unconfined_service_t:s0
_SOURCE_REALTIME_TIMESTAMP 1552307823958694
_SYSTEMD_CGROUP /system.slice/ovirt-ha-agent.service
_SYSTEMD_SLICE system.slice
_SYSTEMD_UNIT ovirt-ha-agent.service
_TRANSPORT syslog
_UID 36
__CURSOR s=705d2472c3764cc9aeca1a60b752dd1a;i=3513d;b=c0369e8283
7644219ec7846b1a26e552;m=36be855e64;t=583d0d46de6ea;x=ad229344b96ba703
__MONOTONIC_TIMESTAMP 235124645476
__REALTIME_TIMESTAMP 1552307823961834
I have repeated this attempt several times. Same trouble.
Thanks.
Pollard