After rebooting a hardware node it can no longer connect in ovirt.
2016-03-17 23:02:46,547 ERROR
[org.ovirt.engine.core.uutils.ssh.SSHDialog]
(org.ovirt.thread.pool-8-thread-9) [7389eed3] SSH error running command
root(a)10.176.3
0.97:'umask 0077; MYTMP="$(TMPDIR="${OVIRT_TMPDIR}" mktemp -d -t
ovirt-XXXXXXXXXX)"; trap "chmod -R u+rwX \"${MYTMP}\" > /dev/null
2>&1;
rm -fr \"${MYTMP}\" > /
dev/null 2>&1" 0; tar --warning=no-timestamp -C "${MYTMP}" -x
&&
"${MYTMP}"/ovirt-host-deploy DIALOG/dialect=str:machine
DIALOG/customization=bool:True': Command returned failure code 1 during
SSH session 'root(a)10.176.30.97'
2016-03-17 23:02:46,547 ERROR
[org.ovirt.engine.core.uutils.ssh.SSHDialog]
(org.ovirt.thread.pool-8-thread-9) [7389eed3] Exception:
java.io.IOException: Command
returned failure code 1 during SSH session 'root(a)10.176.30.97'
at
org.ovirt.engine.core.uutils.ssh.SSHClient.executeCommand(SSHClient.java:527)
[uutils.jar:]
I can run this and it works fine:
[root@ovirt test host-deploy]# HOST="10.176.30.97"; ssh -i
/etc/pki/ovirt-engine/keys/engine_id_rsa "$HOST" 'umask 0077;echo
$OVIRT_TMPDIR; MYTMP="$(TMPDIR="${OVIRT_TMPDIR}" mktemp -d -t
ovirt-XXXXXXXXXX)"; echo MYTMP=$MYTMP; trap "chmod -R u+rwX
\"${MYTMP}\"" 0
'
MYTMP=/tmp/ovirt-IhxDxcDd8y
[root@ovirt test host-deploy]# echo $?
0
When I add the tar part it just waits forever.
Only change I made before reboot was uncommenting 'user = "root"' in
/etc/libvirt/qemu.conf so that import-to-ovirt.pl would work.
Another host had same issue. I tried putting in maintenance and
reinstalling and removing; re-adding. Didn't help.
Finally just re-kickstarted the host and that one works fine now,
complete with the qemu.conf change.
Attached are the engine.log, vdsm.log and supervdsm.log.
I have done some modifications trying to get vdsm hook to work (see
"multiple NICs VLAN ID conflict" thread), but I don't think that is
related, especially since rekickstarting one host fixed it.
ovirt-engine-3.6.3.4-1.el7.centos.noarch
Thanks!