On Tue, Sep 6, 2016 at 6:08 AM, James <mailinglists(a)mooash.com> wrote:
Hey,
I moved our standalone engine from a C6 host to a C7 host and now it
wont install/setup oVirt nodes. It just responds with messages like
this:
Host xxx.xxx.xxx installation failed. Unexpected connection termination.
Failed to install Host xxx.xxx.xxx. Command returned failure code 1
during SSH session 'root(a)xxx.xx.xx.x'.
And from engine.log
2016-09-06 12:20:08,011 ERROR
[org.ovirt.engine.core.uutils.ssh.SSHDialog]
(org.ovirt.thread.pool-8-thread-41) [55397a53] SSH error running command
root@xxx.xx.xx.x:'umask 0077; MYTMP="$(TMPDIR="${OVIRT_TMPDIR}"
mktemp
-d -t ovirt-XXXXXXXXXX)"; trap "chmod -R u+rwX \"${MY
2016-09-06 12:20:08,011 ERROR
[org.ovirt.engine.core.uutils.ssh.SSHDialog]
(org.ovirt.thread.pool-8-thread-41) [55397a53] SSH error running command
root@xxx.xx.xx.x:'umask 0077; MYTMP="$(TMPDIR="${OVIRT_TMPDIR}"
mktemp
-d -t ovirt-XXXXXXXXXX)"; trap "chmod -R u+rwX \"${MY
TMP}\" > /dev/null 2>&1; rm -fr \"${MYTMP}\" > /dev/null
2>&1" 0; tar
--warning=no-timestamp -C "${MYTMP}" -x &&
"${MYTMP}"/ovirt-host-deploy
DIALOG/dialect=str:machine DIALOG/customization=bool:True': Command
returned failure code 1 during SSH session 'root(a)xxx.xx.xx.x'
2016-09-06 12:20:08,007 ERROR
[org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy)
[55397a53] Error during deploy dialog: java.io.IOException: Unexpected
connection termination
at
org.ovirt.otopi.dialog.MachineDialogParser.nextEvent(MachineDialogParser.java:387)
[otopi.jar:]
at
org.ovirt.otopi.dialog.MachineDialogParser.nextEvent(MachineDialogParser.java:404)
[otopi.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase._threadMain(VdsDeployBase.java:304)
[bll.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.access$800(VdsDeployBase.java:45)
[bll.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase$12.run(VdsDeployBase.java:386)
[bll.jar:]
at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_101]
2016-09-06 12:20:08,013 ERROR
[org.ovirt.engine.core.uutils.ssh.SSHDialog]
(org.ovirt.thread.pool-8-thread-41) [55397a53] Exception:
java.io.IOException: Command returned failure code 1 during SSH session
'root(a)xxx.xx.xx.x'
at
org.ovirt.engine.core.uutils.ssh.SSHClient.executeCommand(SSHClient.java:527)
[uutils.jar:]
at
org.ovirt.engine.core.uutils.ssh.SSHDialog.executeCommand(SSHDialog.java:312)
[uutils.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.execute(VdsDeployBase.java:567)
[bll.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand.installHost(InstallVdsInternalCommand.java:189)
[bll.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand.executeCommand(InstallVdsInternalCommand.java:93)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.executeWithoutTransaction(CommandBase.java:1215)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.executeActionInTransactionScope(CommandBase.java:1359)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.runInTransaction(CommandBase.java:1982)
[bll.jar:]
at
org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInSuppressed(TransactionSupport.java:174)
[utils.jar:]
at
org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInScope(TransactionSupport.java:116)
[utils.jar:]
at
org.ovirt.engine.core.bll.CommandBase.execute(CommandBase.java:1396)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.executeAction(CommandBase.java:378)
[bll.jar:]
at
org.ovirt.engine.core.bll.MultipleActionsRunner.executeValidatedCommand(MultipleActionsRunner.java:207)
[bll.jar:]
at
org.ovirt.engine.core.bll.MultipleActionsRunner.runCommands(MultipleActionsRunner.java:172)
[bll.jar:]
at
org.ovirt.engine.core.bll.MultipleActionsRunner$2.run(MultipleActionsRunner.java:181)
[bll.jar:]
at
org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:89)
[utils.jar:]
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
[rt.jar:1.8.0_101]
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
[rt.jar:1.8.0_101]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
[rt.jar:1.8.0_101]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
[rt.jar:1.8.0_101]
at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_101]
2016-09-06 12:20:08,014 ERROR
[org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase]
(org.ovirt.thread.pool-8-thread-41) [55397a53] Error during host
xxx.xx.xx.x install: java.io.IOException: Command returned failure code
1 during SSH session 'root(a)xxx.xx.xx.x'
at
org.ovirt.engine.core.uutils.ssh.SSHClient.executeCommand(SSHClient.java:527)
[uutils.jar:]
at
org.ovirt.engine.core.uutils.ssh.SSHDialog.executeCommand(SSHDialog.java:312)
[uutils.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.execute(VdsDeployBase.java:567)
[bll.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand.installHost(InstallVdsInternalCommand.java:189)
[bll.jar:]
at
org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand.executeCommand(InstallVdsInternalCommand.java:93)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.executeWithoutTransaction(CommandBase.java:1215)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.executeActionInTransactionScope(CommandBase.java:1359)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.runInTransaction(CommandBase.java:1982)
[bll.jar:]
at
org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInSuppressed(TransactionSupport.java:174)
[utils.jar:]
at
org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInScope(TransactionSupport.java:116)
[utils.jar:]
at
org.ovirt.engine.core.bll.CommandBase.execute(CommandBase.java:1396)
[bll.jar:]
at
org.ovirt.engine.core.bll.CommandBase.executeAction(CommandBase.java:378)
[bll.jar:]
at
org.ovirt.engine.core.bll.MultipleActionsRunner.executeValidatedCommand(MultipleActionsRunner.java:207)
[bll.jar:]
at
org.ovirt.engine.core.bll.MultipleActionsRunner.runCommands(MultipleActionsRunner.java:172)
[bll.jar:]
at
org.ovirt.engine.core.bll.MultipleActionsRunner$2.run(MultipleActionsRunner.java:181)
[bll.jar:]
at
org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:89)
[utils.jar:]
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
[rt.jar:1.8.0_101]
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
[rt.jar:1.8.0_101]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
[rt.jar:1.8.0_101]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
[rt.jar:1.8.0_101]
at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_101]
I've confirmed connectivity from the engine to the hosts themselves and
it can SSH fine, even running the above command manually works fine.
To confirm, I migrated the oVirt engine by taking a backup, shutting
down the old host, changing the hostname/IP of the new host to be the
same as the old and restoring the backup. This is happening on oVirt 3.6
and is currently stopping us from being able to upgrade to oVirt 4.0
since we cannot convert one of our clusters to 3.6 compatibility mode
(its currently on 3.5) without migrating the VM hosts. It happens on a
fresh install or a reinstall, so not sure whats happening.
Can you please check/attach more of engine.log from before the error? Start
with the line having 'Before Installation'. Thanks.
--
Didi