[ovirt-users] Can HA Agent control NFS Mount?

Joop jvdwege at xs4all.nl
Fri May 30 20:06:49 UTC 2014


Bob Doolittle wrote:
> Joop,
>
> On 05/26/2014 02:43 AM, Joop wrote:
>> Yesterday evening I have found the service responsible for the reboot 
>> instead of the powerdown. If I do: service wdmd stop the server will 
>> reboot. It seems the watchdog is hung up and eventually this will 
>> lead to a crash and thus a reboot instead of the shutdown.
>>
>> Anyone knows how to debug this?
>
> Did you get anywhere with this?
> Pretty nasty. Is there a bug open?
>
> We're getting a timeout on an NFS mount during the powerdown 
> (single-node hosted, after global maintenance enabled and engine 
> powered off), and that makes the machine reboot and try to come back 
> up again instead of powering off.
>
> So two issues:
> - What is the mount that is hanging (probably an oVirt issue)?
Don´t know what that problem is. I have a local nfs mount but don´t 
experience that problem

> - Why does the system reboot instead of powering down as instructed (?)?
>

the reboot is caused by wdmd. Docs says that if the watchdogs aren´t 
responding that a reset will follow. So our init 0 is overruled because 
of the hanging watchdog. Why it is hanging I don´t know. Could be my 
chipset, could be the version of wdmd-kernel. Only thing is I know it 
didn´t happen always in the past but that is as much as I remember, sorry.

Joop





More information about the Users mailing list