<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL//EN">
<HTML>
<HEAD>
  <META HTTP-EQUIV="Content-Type" CONTENT="text/html; CHARSET=UTF-8">
  <META NAME="GENERATOR" CONTENT="GtkHTML/4.6.4">
</HEAD>
<BODY>
Good afternoon,<BR>
<BR>
I've having a problem with my hosts, at least one time per week the host that has all the VMs running restarts and becomes unresponsible.<BR>
<BR>
After the restart sent to the ilo by the ovirt engine the host becomes unresponsible, the fans on the enclosure go up like crazy.<BR>
<BR>
Then the only way to get the blade up is to stop it using ilo or onboard administrator,<BR>
and then remove it from the enclosure and put it back in and then issue the start using ovirt gui, because using stop/start on the ilo or onboard administrator the blade powers up but becomes unresponsible, doesn't show any image or any boot post messages.<BR>
<BR>
<BR>
Anyone else seen this problem before?<BR>
<BR>
BLADE ENCLOSURE: HP BladeSystem c3000<BR>
BLADES: HP BL460c G6<BR>
OS: CentOS 6.4 (64 bits)<BR>
OVIRT: 3.2<BR>
<BR>
<BR>
engine.log:<BR>
<BR>
2013-08-07 14:38:47,256 INFO&nbsp; [org.ovirt.engine.core.bll.storage.SetStoragePoolStatusCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] Running command: SetStoragePoolStatusCommand interna<BR>
l: true. Entities affected :&nbsp; ID: 06951dba-556b-4323-9356-819c9160fe8e Type: StoragePool<BR>
2013-08-07 14:38:47,257 ERROR [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-8) vds::refreshVdsStats Failed getVdsStats,&nbsp; vds = 44d77dcb-b775-4aef-ae59-<BR>
1dea8d5c691a : blade5, error = VDSNetworkException: java.net.NoRouteToHostException: No route to host<BR>
2013-08-07 14:38:47,263 WARN&nbsp; [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler_Worker-8) ResourceManager::refreshVdsRunTimeInfo::Failed to refresh VDS , vds = 44d77dcb-b77<BR>
5-4aef-ae59-1dea8d5c691a : blade5, VDS Network Error, continuing.<BR>
java.net.NoRouteToHostException: No route to host<BR>
2013-08-07 14:38:50,252 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] IrsBroker::Failed::GetStoragePoolInfoVDS due to: NoRout<BR>
eToHostException: No route to host<BR>
2013-08-07 14:38:50,253 WARN&nbsp; [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler_Worker-10) [2b4cb7c5] ResourceManager::refreshVdsRunTimeInfo::Failed to refresh VDS , vds = <BR>
44d77dcb-b775-4aef-ae59-1dea8d5c691a : blade5, VDS Network Error, continuing.<BR>
java.net.NoRouteToHostException: No route to host<BR>
2013-08-07 14:38:53,252 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] Irs placed on server 44d77dcb-b775-4aef-ae59-1dea8d5c69<BR>
1a failed. Proceed Failover<BR>
2013-08-07 14:38:53,254 ERROR [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler_Worker-4) VDS::handleNetworkException Server failed to respond,&nbsp; vds_id = 44d77dcb-b775-4aef<BR>
-ae59-1dea8d5c691a, vds_name = blade5, error = java.net.NoRouteToHostException: No route to host<BR>
2013-08-07 14:38:53,296 INFO&nbsp; [org.ovirt.engine.core.bll.VdsEventListener] (pool-3-thread-47) ResourceManager::vdsNotResponding entered for Host 44d77dcb-b775-4aef-ae59-1dea8d5c691a, 192.168.<BR>
10.25<BR>
2013-08-07 14:38:53,299 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] hostFromVds::selectedVds - blade6, spmStatus Free, stor<BR>
age pool VI-DataCenter<BR>
2013-08-07 14:38:53,308 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] SPM Init: could not find reported vds or not up - pool:<BR>
VI-DataCenter vds_spm_id: 1<BR>
2013-08-07 14:38:53,346 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] SPM selection - vds seems as spm blade5<BR>
2013-08-07 14:38:53,355 WARN&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] spm vds is non responsive, stopping spm selection.<BR>
2013-08-07 14:38:53,438 INFO&nbsp; [org.ovirt.engine.core.bll.FenceExecutor] (pool-3-thread-47) Using Host blade6 from CLUSTER as proxy to execute Restart command on Host blade5<BR>
2013-08-07 14:38:53,438 INFO&nbsp; [org.ovirt.engine.core.bll.FenceExecutor] (pool-3-thread-47) Executing &lt;Status&gt; Power Management command, Proxy Host:blade6, Agent:ilo, Target Host:blade5, Manag<BR>
ement IP:ilo5.vi.pt, User:Administrator, Options:secure=true<BR>
2013-08-07 14:38:53,457 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (pool-3-thread-47) START, FenceVdsVDSCommand(HostName = blade6, HostId = 2530f498-6029-496a-ab42-9<BR>
24ca2e3eb7f, targetVdsId = 44d77dcb-b775-4aef-ae59-1dea8d5c691a, action = Status, ip = ilo5.vi.pt, port = , type = ilo, user = Administrator, password = ******, options = 'secure=true'), log <BR>
id: 41a729f3<BR>
2013-08-07 14:39:02,533 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (pool-3-thread-47) FINISH, FenceVdsVDSCommand, return: Test Succeeded, Host Status is: on, log id:<BR>
 41a729f3<BR>
2013-08-07 14:39:02,541 INFO&nbsp; [org.ovirt.engine.core.bll.VdsNotRespondingTreatmentCommand] (pool-3-thread-47) Running command: VdsNotRespondingTreatmentCommand internal: true. Entities affect<BR>
ed :&nbsp; ID: 44d77dcb-b775-4aef-ae59-1dea8d5c691a Type: VDS<BR>
2013-08-07 14:39:02,598 INFO&nbsp; [org.ovirt.engine.core.bll.StopVdsCommand] (pool-3-thread-47) [56fa00a1] Running command: StopVdsCommand internal: true. Entities affected :&nbsp; ID: 44d77dcb-b775-4<BR>
aef-ae59-1dea8d5c691a Type: VDS<BR>
2013-08-07 14:39:02,619 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-3-thread-47) [56fa00a1] START, SetVdsStatusVDSCommand(HostName = blade5, HostId = 44d77dcb-b775-4a<BR>
ef-ae59-1dea8d5c691a, status=Reboot, nonOperationalReason=NONE), log id: 20a49440<BR>
2013-08-07 14:39:02,622 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-3-thread-47) [56fa00a1] VDS blade5 is spm and moved from up calling ResetIrs.<BR>
2013-08-07 14:39:02,622 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.ResetIrsVDSCommand] (pool-3-thread-47) [56fa00a1] START, ResetIrsVDSCommand( storagePoolId = 06951dba-556b-4323-9356-8<BR>
19c9160fe8e, ignoreFailoverLimit = false, compatabilityVersion = null, vdsId = 44d77dcb-b775-4aef-ae59-1dea8d5c691a, ignoreStopFailed = false), log id: 3b546d0a<BR>
2013-08-07 14:39:02,643 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (pool-3-thread-47) [56fa00a1] START, SpmStopVDSCommand(HostName = blade5, HostId = 44d77dcb-b775-4a<BR>
ef-ae59-1dea8d5c691a, storagePoolId = 06951dba-556b-4323-9356-819c9160fe8e), log id: 283e8812<BR>
2013-08-07 14:39:02,644 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (pool-3-thread-47) [56fa00a1] SpmStopVDSCommand:: vds blade5 is in Reboot status - not performing s<BR>
pm stop, pool id 06951dba-556b-4323-9356-819c9160fe8e<BR>
2013-08-07 14:39:02,644 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (pool-3-thread-47) [56fa00a1] FINISH, SpmStopVDSCommand, log id: 283e8812<BR>
2013-08-07 14:39:02,645 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.ResetIrsVDSCommand] (pool-3-thread-47) [56fa00a1] FINISH, ResetIrsVDSCommand, log id: 3b546d0a<BR>
2013-08-07 14:39:02,645 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-3-thread-47) [56fa00a1] FINISH, SetVdsStatusVDSCommand, log id: 20a49440<BR>
2013-08-07 14:39:02,699 INFO&nbsp; [org.ovirt.engine.core.bll.FenceExecutor] (pool-3-thread-47) [56fa00a1] Using Host blade6 from CLUSTER as proxy to execute Stop command on Host blade5<BR>
2013-08-07 14:39:02,735 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (pool-3-thread-47) [56fa00a1] START, SpmStopVDSCommand(HostName = blade5, HostId = 44d77dcb-b775-4a<BR>
ef-ae59-1dea8d5c691a, storagePoolId = 06951dba-556b-4323-9356-819c9160fe8e), log id: 6bbeb9ca<BR>
2013-08-07 14:39:02,736 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (pool-3-thread-47) [56fa00a1] SpmStopVDSCommand:: vds blade5 is in Reboot status - not performing s<BR>
pm stop, pool id 06951dba-556b-4323-9356-819c9160fe8e<BR>
2013-08-07 14:39:02,736 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (pool-3-thread-47) [56fa00a1] FINISH, SpmStopVDSCommand, log id: 6bbeb9ca<BR>
2013-08-07 14:39:02,737 INFO&nbsp; [org.ovirt.engine.core.bll.FenceExecutor] (pool-3-thread-47) [56fa00a1] Executing &lt;Stop&gt; Power Management command, Proxy Host:blade6, Agent:ilo, Target Host:blad<BR>
e5, Management IP:ilo5.vi.pt, User:Administrator, Options:secure=true<BR>
2013-08-07 14:39:02,755 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (pool-3-thread-47) [56fa00a1] START, FenceVdsVDSCommand(HostName = blade6, HostId = 2530f498-6029-<BR>
496a-ab42-924ca2e3eb7f, targetVdsId = 44d77dcb-b775-4aef-ae59-1dea8d5c691a, action = Stop, ip = ilo5.vi.pt, port = , type = ilo, user = Administrator, password = ******, options = 'secure=tru<BR>
e'), log id: 6f3cc543<BR>
2013-08-07 14:39:03,388 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-1) [18d8826] hostFromVds::selectedVds - blade6, spmStatus Free, stora<BR>
ge pool VI-DataCenter<BR>
2013-08-07 14:39:03,392 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-1) [18d8826] SPM Init: could not find reported vds or not up - pool:V<BR>
I-DataCenter vds_spm_id: 1<BR>
2013-08-07 14:39:03,411 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-1) [18d8826] SPM selection - vds seems as spm blade5<BR>
2013-08-07 14:39:03,430 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (DefaultQuartzScheduler_Worker-1) [18d8826] START, SpmStopVDSCommand(HostName = blade5, HostId = 44<BR>
d77dcb-b775-4aef-ae59-1dea8d5c691a, storagePoolId = 06951dba-556b-4323-9356-819c9160fe8e), log id: 5ae3209c<BR>
2013-08-07 14:39:03,431 INFO&nbsp; [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (DefaultQuartzScheduler_Worker-1) [18d8826] SpmStopVDSCommand:: vds blade5 is in Reboot status - no<BR>
t performing spm stop, pool id 06951dba-556b-4323-9356-819c9160fe8e<BR>
<BR>
Best regards,<BR>
Ricardo Esteves.
<PRE>

</PRE>
</BODY>
</HTML>