[ovirt-users] Can HA Agent control NFS Mount?

Andrew Lau andrew at andrewklau.com
Sat May 31 03:05:04 EDT 2014


On Sat, May 31, 2014 at 6:06 AM, Joop <jvdwege at xs4all.nl> wrote:
> Bob Doolittle wrote:
>>
>> Joop,
>>
>> On 05/26/2014 02:43 AM, Joop wrote:
>>>
>>> Yesterday evening I have found the service responsible for the reboot
>>> instead of the powerdown. If I do: service wdmd stop the server will reboot.
>>> It seems the watchdog is hung up and eventually this will lead to a crash
>>> and thus a reboot instead of the shutdown.
>>>
>>> Anyone knows how to debug this?
>>
>>
>> Did you get anywhere with this?
>> Pretty nasty. Is there a bug open?
>>
>> We're getting a timeout on an NFS mount during the powerdown (single-node
>> hosted, after global maintenance enabled and engine powered off), and that
>> makes the machine reboot and try to come back up again instead of powering
>> off.
>>
>> So two issues:
>> - What is the mount that is hanging (probably an oVirt issue)?
>
> Don´t know what that problem is. I have a local nfs mount but don´t
> experience that problem

I got to the console when mine went for a reboot. I see sanlock and
wdmd failing to shutdown properly which would explain why it doesn't
unmount properly.

>
>
>> - Why does the system reboot instead of powering down as instructed (?)?
>>
>
> the reboot is caused by wdmd. Docs says that if the watchdogs aren´t
> responding that a reset will follow. So our init 0 is overruled because of
> the hanging watchdog. Why it is hanging I don´t know. Could be my chipset,
> could be the version of wdmd-kernel. Only thing is I know it didn´t happen
> always in the past but that is as much as I remember, sorry.
>
>
> Joop
>
>
> _______________________________________________
> Users mailing list
> Users at ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users


More information about the Users mailing list