[ovirt-users] Why Node was rebooting?

Jonathan Baecker jonbae77 at gmail.com
Sat Nov 25 20:22:49 UTC 2017


I do setup power management, but because the second node if off, it's 
working not correctly. I will install now a vm on a different server, 
just for using it as a proxy.
But you think this can be the reason?


Am 25.11.2017 um 20:36 schrieb Charles Kozler:
> Did you setup fencing?
>
> I've also seen this behavior with stressed CPU and NMI watch dog in 
> BIOS rebooting a server but that was on freebsd. Have not seen it on 
> Linux
>
> On Nov 25, 2017 2:07 PM, "Jonathan Baecker" <jonbae77 at gmail.com 
> <mailto:jonbae77 at gmail.com>> wrote:
>
>     Hello community,
>
>     yesterday evening one of our nodes was rebooted, but I have not
>     found out why. The engine only reports this:
>
>             24.11.2017 22:01:43 Storage Pool Manager runs on Host
>             onode-1 (Address: onode-1.worknet.lan).
>             24.11.2017 21:58:50 Failed to verify Host onode-1 power
>             management.
>             24.11.2017 21:58:50 Status of host onode-1 was set to Up.
>             24.11.2017 21:58:41 Successfully refreshed the
>             capabilities of host onode-1.
>             24.11.2017 21:58:37 VDSM onode-1 command
>             GetCapabilitiesVDS failed: Client close
>             24.11.2017 21:58:37 VDSM onode-1 command
>             HSMGetAllTasksStatusesVDS failed: Not SPM: ()
>             24.11.2017 21:58:22 Host onode-1 is rebooting.
>             24.11.2017 21:58:22 Kdump flow is not in progress on host
>             onode-1.
>             24.11.2017 21:57:51 Host onode-1 is non responsive.
>             24.11.2017 21:57:51 VM playout was set to the Unknown status.
>             24.11.2017 21:57:51 VM gogs was set to the Unknown status.
>             24.11.2017 21:57:51 VM Windows2008 was set to the Unknown
>             status.
>             [...]
>
>     There is no crash report, and no relevant errors in dmesg.
>
>     Does the engine send a reboot command to the node, when it gets no
>     responds? Is there any other way to found out why the node was
>     rebooting? The node hangs on a usv and all other servers was
>     running well...
>
>     In the time, when the reboot was happen, I had a bigger video
>     compression job in one of the VMs, so maybe the CPUs got a bit
>     stressed, but they are not over committed.
>
>
>     Regards
>
>     Jonathan
>
>
>     _______________________________________________
>     Users mailing list
>     Users at ovirt.org <mailto:Users at ovirt.org>
>     http://lists.ovirt.org/mailman/listinfo/users
>     <http://lists.ovirt.org/mailman/listinfo/users>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20171125/bd1e1915/attachment.html>


More information about the Users mailing list