--Apple-Mail=_5C6B714A-9630-480A-A42C-A8B11D48E3FB
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
charset=utf-8
Try restarting vdsmd from the shell, =E2=80=9Csystemctl restart =
vdsmd=E2=80=9D.
From: Artem Tambovskiy <artem.tambovskiy(a)gmail.com>
Subject: [ovirt-users] Non-responsive host, VM's are still running - =
how to
resolve?
Date: November 14, 2017 at 11:23:32 AM CST
To: users
=20
Apparently, i lost the host which was running hosted-engine and =
another 4
VM's exactly during migration of second host from bare-metal =
to second host in the cluster. For some reason first host entered the =
"Non reponsive" state. The interesting thing is that hosted-engine and =
all other VM's up and running, so its like a communication problem =
between hosted-engine and host.=20
=20
The engine.log at hosted-engine is full of following messages:
=20
2017-11-14 17:06:43,158Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:43,159Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetAllVmStatsVDSCommand] =
(DefaultQuartzScheduler9) [50938c3] Command =
'GetAllVmStatsVDSCommand(HostName =3D ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>, =
VdsIdVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243'})' execution failed: =
java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:43,159Z INFO =
[org.ovirt.engine.core.vdsbroker.monitoring.PollVmStatsRefresher] =
(DefaultQuartzScheduler9) [50938c3] Failed to fetch vms info for host =
'ovirt2.telia.ru <
http://ovirt2.telia.ru/>' - skipping VMs monitoring.
2017-11-14 17:06:45,929Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:45,930Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler2) [6080f1cc] Command =
'GetCapabilitiesVDSCommand(HostName =3D ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', =
vds=3D'Host[ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>,3970247c-69eb-4bd8-b263-9100703a8243]'})&... =
execution failed: java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:45,930Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler2) [6080f1cc] Failure to refresh host =
'ovirt2.telia.ru <
http://ovirt2.telia.ru/>' runtime info: =
java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:48,933Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:48,934Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler6) [1a64dfea] Command =
'GetCapabilitiesVDSCommand(HostName =3D ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', =
vds=3D'Host[ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>,3970247c-69eb-4bd8-b263-9100703a8243]'})&... =
execution failed: java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:48,934Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler6) [1a64dfea] Failure to refresh host =
'ovirt2.telia.ru <
http://ovirt2.telia.ru/>' runtime info: =
java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:50,931Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:50,932Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStatusVDSCommand] =
(DefaultQuartzScheduler4) [6b19d168] Command =
'SpmStatusVDSCommand(HostName =3D ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>, =
SpmStatusVDSCommandParameters:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', =
storagePoolId=3D'5a044257-02ec-0382-0243-0000000001f2'})' execution =
failed: java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:50,939Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:50,940Z ERROR =
[org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] =
(DefaultQuartzScheduler4) [6b19d168] =
IrsBroker::Failed::GetStoragePoolInfoVDS
2017-11-14 17:06:50,940Z ERROR =
[org.ovirt.engine.core.vdsbroker.irsbroker.GetStoragePoolInfoVDSCommand] =
(DefaultQuartzScheduler4) [6b19d168] Command =
'GetStoragePoolInfoVDSCommand( =
GetStoragePoolInfoVDSCommandParameters:{runAsync=3D'true', =
storagePoolId=3D'5a044257-02ec-0382-0243-0000000001f2', =
ignoreFailoverLimit=3D'true'})' execution failed: IRSProtocolException:=20=
2017-11-14 17:06:51,937Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:51,938Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler7) [7f23a3bd] Command =
'GetCapabilitiesVDSCommand(HostName =3D ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', =
vds=3D'Host[ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>,3970247c-69eb-4bd8-b263-9100703a8243]'})&... =
execution failed: java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:51,938Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler7) [7f23a3bd] Failure to refresh host =
'ovirt2.telia.ru <
http://ovirt2.telia.ru/>' runtime info: =
java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:54,941Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/80.239.162.106 <
http://80.239.162.106/>
2017-11-14 17:06:54,942Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler2) [7a769f6c] Command =
'GetCapabilitiesVDSCommand(HostName =3D ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', =
vds=3D'Host[ovirt2.telia.ru =
<
http://ovirt2.telia.ru/>,3970247c-69eb-4bd8-b263-9100703a8243]'})&... =
execution failed: java.net.NoRouteToHostException: No route to host
2017-11-14 17:06:54,942Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler2) [7a769f6c] Failure to refresh host =
'ovirt2.telia.ru <
http://ovirt2.telia.ru/>' runtime info: =
java.net.NoRouteToHostException: No route to host
=20
Its a bit weird, since I can ping and login via ssh to the host from =
hosted-engine with no problem. I have added second host to the cluster, =
but it not running hosted-engine. Any suggestion for the further steps? =
Just reboot the host and hope for the best?
=20
Regards,
Artem
_______________________________________________
Users mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
--Apple-Mail=_5C6B714A-9630-480A-A42C-A8B11D48E3FB
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
charset=utf-8
<html><head><meta http-equiv=3D"Content-Type"
content=3D"text/html; =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; line-break: after-white-space;" class=3D"">Try
=
restarting vdsmd from the shell, =E2=80=9Csystemctl restart =
vdsmd=E2=80=9D.<div class=3D""><br
class=3D""></div><div class=3D""><br =
class=3D""><div><blockquote type=3D"cite"
class=3D""><hr =
style=3D"border:none;border-top:solid #B5C4DF 1.0pt;padding:0 0 0 =
0;margin:10px 0 5px 0;" class=3D""><span style=3D"margin: -1.3px
0.0px =
0.0px 0.0px" id=3D"RwhHeaderAttributes" class=3D""><font =
face=3D"Helvetica" size=3D"4" color=3D"#000000"
style=3D"font: 13.0px =
Helvetica; color: #000000" class=3D""><b
class=3D"">From:</b> Artem =
Tambovskiy <<a href=3D"mailto:artem.tambovskiy@gmail.com" =
class=3D"">artem.tambovskiy(a)gmail.com</a>&gt;</font></span><br
class=3D"">=
<span style=3D"margin: -1.3px 0.0px 0.0px 0.0px"
class=3D""><font =
face=3D"Helvetica" size=3D"4" color=3D"#000000"
style=3D"font: 13.0px =
Helvetica; color: #000000" class=3D""><b
class=3D"">Subject:</b> =
[ovirt-users] Non-responsive host, VM's are still running - how to =
resolve?</font></span><br class=3D"">
<span style=3D"margin: -1.3px 0.0px 0.0px 0.0px"
class=3D""><font =
face=3D"Helvetica" size=3D"4" color=3D"#000000"
style=3D"font: 13.0px =
Helvetica; color: #000000" class=3D""><b
class=3D"">Date:</b> November =
14, 2017 at 11:23:32 AM CST</font></span><br class=3D"">
<span style=3D"margin: -1.3px 0.0px 0.0px 0.0px"
class=3D""><font =
face=3D"Helvetica" size=3D"4" color=3D"#000000"
style=3D"font: 13.0px =
Helvetica; color: #000000" class=3D""><b
class=3D"">To:</b> =
users</font></span><br class=3D"">
<br class=3D"Apple-interchange-newline"><div
class=3D""><div dir=3D"ltr" =
class=3D"">Apparently, i lost the host which was running hosted-engine =
and another 4 VM's exactly during migration of second host from =
bare-metal to second host in the cluster. For some reason first host =
entered the "Non reponsive" state. The interesting thing is that =
hosted-engine and all other VM's up and running, so its like a =
communication problem between hosted-engine and host. <div =
class=3D""><br class=3D""></div><div
class=3D"">The engine.log at =
hosted-engine is full of following messages:</div><div
class=3D""><br =
class=3D""></div><div class=3D""><div
class=3D"">2017-11-14 =
17:06:43,158Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:43,159Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetAllVmStatsVDSCommand] =
(DefaultQuartzScheduler9) [50938c3] Command =
'GetAllVmStatsVDSCommand(HostName =3D <a href=3D"http://ovirt2.telia.ru/"
=
class=3D"">ovirt2.telia.ru</a>, =
VdsIdVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243'})' execution failed: =
java.net.NoRouteToHostException: No route to host</div><div =
class=3D"">2017-11-14 17:06:43,159Z INFO =
[org.ovirt.engine.core.vdsbroker.monitoring.PollVmStatsRefresher] =
(DefaultQuartzScheduler9) [50938c3] Failed to fetch vms info for host =
'<a href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>' - =
skipping VMs monitoring.</div><div class=3D"">2017-11-14
17:06:45,929Z =
INFO [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL =
Stomp Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:45,930Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler2) [6080f1cc] Command =
'GetCapabilitiesVDSCommand(HostName =3D <a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', vds=3D'Host[<a =
href=3D"http://ovirt2.telia.ru/" =
class=3D"">ovirt2.telia.ru</a>,3970247c-69eb-4bd8-b263-9100703a8243]'})'
=
execution failed: java.net.NoRouteToHostException: No route to =
host</div><div class=3D"">2017-11-14 17:06:45,930Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler2) [6080f1cc] Failure to refresh host '<a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>' runtime =
info: java.net.NoRouteToHostException: No route to host</div><div =
class=3D"">2017-11-14 17:06:48,933Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:48,934Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler6) [1a64dfea] Command =
'GetCapabilitiesVDSCommand(HostName =3D <a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', vds=3D'Host[<a =
href=3D"http://ovirt2.telia.ru/" =
class=3D"">ovirt2.telia.ru</a>,3970247c-69eb-4bd8-b263-9100703a8243]'})'
=
execution failed: java.net.NoRouteToHostException: No route to =
host</div><div class=3D"">2017-11-14 17:06:48,934Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler6) [1a64dfea] Failure to refresh host '<a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>' runtime =
info: java.net.NoRouteToHostException: No route to host</div><div =
class=3D"">2017-11-14 17:06:50,931Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:50,932Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStatusVDSCommand] =
(DefaultQuartzScheduler4) [6b19d168] Command =
'SpmStatusVDSCommand(HostName =3D <a href=3D"http://ovirt2.telia.ru/" =
class=3D"">ovirt2.telia.ru</a>, =
SpmStatusVDSCommandParameters:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', =
storagePoolId=3D'5a044257-02ec-0382-0243-0000000001f2'})' execution =
failed: java.net.NoRouteToHostException: No route to host</div><div =
class=3D"">2017-11-14 17:06:50,939Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:50,940Z ERROR =
[org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] =
(DefaultQuartzScheduler4) [6b19d168] =
IrsBroker::Failed::GetStoragePoolInfoVDS</div><div
class=3D"">2017-11-14 =
17:06:50,940Z ERROR =
[org.ovirt.engine.core.vdsbroker.irsbroker.GetStoragePoolInfoVDSCommand] =
(DefaultQuartzScheduler4) [6b19d168] Command =
'GetStoragePoolInfoVDSCommand( =
GetStoragePoolInfoVDSCommandParameters:{runAsync=3D'true', =
storagePoolId=3D'5a044257-02ec-0382-0243-0000000001f2', =
ignoreFailoverLimit=3D'true'})' execution failed: =
IRSProtocolException: </div><div class=3D"">2017-11-14 =
17:06:51,937Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:51,938Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler7) [7f23a3bd] Command =
'GetCapabilitiesVDSCommand(HostName =3D <a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', vds=3D'Host[<a =
href=3D"http://ovirt2.telia.ru/" =
class=3D"">ovirt2.telia.ru</a>,3970247c-69eb-4bd8-b263-9100703a8243]'})'
=
execution failed: java.net.NoRouteToHostException: No route to =
host</div><div class=3D"">2017-11-14 17:06:51,938Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler7) [7f23a3bd] Failure to refresh host '<a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>' runtime =
info: java.net.NoRouteToHostException: No route to host</div><div =
class=3D"">2017-11-14 17:06:54,941Z INFO =
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp =
Reactor) [] Connecting to ovirt2/<a href=3D"http://80.239.162.106/" =
class=3D"">80.239.162.106</a></div><div
class=3D"">2017-11-14 =
17:06:54,942Z ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] =
(DefaultQuartzScheduler2) [7a769f6c] Command =
'GetCapabilitiesVDSCommand(HostName =3D <a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>, =
VdsIdAndVdsVDSCommandParametersBase:{runAsync=3D'true', =
hostId=3D'3970247c-69eb-4bd8-b263-9100703a8243', vds=3D'Host[<a =
href=3D"http://ovirt2.telia.ru/" =
class=3D"">ovirt2.telia.ru</a>,3970247c-69eb-4bd8-b263-9100703a8243]'})'
=
execution failed: java.net.NoRouteToHostException: No route to =
host</div><div class=3D"">2017-11-14 17:06:54,942Z ERROR =
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] =
(DefaultQuartzScheduler2) [7a769f6c] Failure to refresh host '<a =
href=3D"http://ovirt2.telia.ru/"
class=3D"">ovirt2.telia.ru</a>' runtime =
info: java.net.NoRouteToHostException: No route to host</div></div><div =
class=3D""><br class=3D""></div><div
class=3D"">Its a bit weird, since I =
can ping and login via ssh to the host from hosted-engine with no =
problem. I have added second host to the cluster, but it not running =
hosted-engine. Any suggestion for the further steps? Just reboot the =
host and hope for the best?</div><div class=3D""><br
class=3D""></div><div=
class=3D"">Regards,</div><div
class=3D"">Artem</div></div>
_______________________________________________<br class=3D"">Users =
mailing list<br class=3D""><a href=3D"mailto:Users@ovirt.org"
=
class=3D"">Users(a)ovirt.org</a><br =
class=3D"">http://lists.ovirt.org/mailman/listinfo/users<br =
class=3D""></div></blockquote></div><br
class=3D""></div></body></html>=
--Apple-Mail=_5C6B714A-9630-480A-A42C-A8B11D48E3FB--