On Sat, May 31, 2014 at 6:06 AM, Joop <jvdwege(a)xs4all.nl> wrote:
Bob Doolittle wrote:
>
> Joop,
>
> On 05/26/2014 02:43 AM, Joop wrote:
>>
>> Yesterday evening I have found the service responsible for the reboot
>> instead of the powerdown. If I do: service wdmd stop the server will reboot.
>> It seems the watchdog is hung up and eventually this will lead to a crash
>> and thus a reboot instead of the shutdown.
>>
>> Anyone knows how to debug this?
>
>
> Did you get anywhere with this?
> Pretty nasty. Is there a bug open?
>
> We're getting a timeout on an NFS mount during the powerdown (single-node
> hosted, after global maintenance enabled and engine powered off), and that
> makes the machine reboot and try to come back up again instead of powering
> off.
>
> So two issues:
> - What is the mount that is hanging (probably an oVirt issue)?
Don´t know what that problem is. I have a local nfs mount but don´t
experience that problem
I got to the console when mine went for a reboot. I see sanlock and
wdmd failing to shutdown properly which would explain why it doesn't
unmount properly.
> - Why does the system reboot instead of powering down as instructed (?)?
>
the reboot is caused by wdmd. Docs says that if the watchdogs aren´t
responding that a reset will follow. So our init 0 is overruled because of
the hanging watchdog. Why it is hanging I don´t know. Could be my chipset,
could be the version of wdmd-kernel. Only thing is I know it didn´t happen
always in the past but that is as much as I remember, sorry.
Joop
_______________________________________________
Users mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users