[ovirt-users] rebooting hypervisors from time to time

Erekle Magradze erekle.magradze at recogizer.de
Fri Feb 23 17:43:21 UTC 2018


Hi,

Thanks a lot for having a look.

HA VMs were migrated, non HA vms were turned off, syslogs were not 
saying anything useful, dmesg reported graceful reboot.

What errors are indicating? may be there is a useful hint to proceed in 
investigation?

Thanks in advance again

Cheers

Erekle


On 02/23/2018 06:15 PM, Mahdi Adnan wrote:
> Hi,
>
> The log does't indicate HV reboot, and i see lots of errors in the logs.
> During the reboot, what happened to the VM inside of the HV ? migrated 
> ? paused ? what about the system's logs ? does it indicate a graceful 
> shutdown ?
>
>
> -- 
>
> Respectfully*
> **Mahdi A. Mahdi*
>
> ------------------------------------------------------------------------
> *From:* Erekle Magradze <erekle.magradze at recogizer.de>
> *Sent:* Friday, February 23, 2018 2:48 PM
> *To:* Mahdi Adnan; users at ovirt.org
> *Subject:* Re: [ovirt-users] rebooting hypervisors from time to time
>
> Thanks for the reply,
>
> I've attached all the logs from yesterday, reboot has happened during 
> the day but this is not the first time and this is not the only one 
> hypervisor.
>
> Kind Regards
>
> Erekle
>
>
> On 02/23/2018 09:00 AM, Mahdi Adnan wrote:
>> Hi,
>>
>> Can you post the VDSM and Engine logs ?
>>
>>
>> -- 
>>
>> Respectfully*
>> **Mahdi A. Mahdi*
>>
>> ------------------------------------------------------------------------
>> *From:* users-bounces at ovirt.org <mailto:users-bounces at ovirt.org> 
>> <users-bounces at ovirt.org> <mailto:users-bounces at ovirt.org> on behalf 
>> of Erekle Magradze <erekle.magradze at recogizer.de> 
>> <mailto:erekle.magradze at recogizer.de>
>> *Sent:* Thursday, February 22, 2018 11:48 PM
>> *To:* users at ovirt.org <mailto:users at ovirt.org>
>> *Subject:* Re: [ovirt-users] rebooting hypervisors from time to time
>> Dear all,
>>
>> It would be great if someone will share any experience regarding the
>> similar case, would be great to have a hint where to start investigation.
>>
>> Thanks again
>>
>> Cheers
>>
>> Erekle
>>
>>
>> On 02/22/2018 05:05 PM, Erekle Magradze wrote:
>> > Hello there,
>> >
>> > I am facing the following problem from time to time one of the
>> > hypervisor (there are 3 of them)s is rebooting, I am using
>> > ovirt-release42-4.2.1-1.el7.centos.noarch and glsuter as a storage
>> > backend (glusterfs-3.12.5-2.el7.x86_64).
>> >
>> > I am suspecting gluster because of the e.g. message bellow from one of
>> > the volumes,
>> >
>> > Could you please help and suggest to which direction should
>> > investigation go?
>> >
>> > Thanks in advance
>> >
>> > Cheers
>> >
>> > Erekle
>> >
>> >
>> > [2018-02-22 15:36:10.011687] and [2018-02-22 15:37:10.955013]
>> > [2018-02-22 15:41:10.198701] I [MSGID: 109063]
>> > [dht-layout.c:716:dht_layout_normalize] 0-virtimages-dht: Found
>> > anomalies in (null) (gfid = 00000000-0000-0000-0000-000000000000).
>> > Holes=1 overlaps=0
>> > [2018-02-22 15:41:10.198704] I [MSGID: 109063]
>> > [dht-layout.c:716:dht_layout_normalize] 0-virtimages-dht: Found
>> > anomalies in (null) (gfid = 00000000-0000-0000-0000-000000000000).
>> > Holes=1 overlaps=0
>> > [2018-02-22 15:42:11.293608] I [MSGID: 109063]
>> > [dht-layout.c:716:dht_layout_normalize] 0-virtimages-dht: Found
>> > anomalies in (null) (gfid = 00000000-0000-0000-0000-000000000000).
>> > Holes=1 overlaps=0
>> > [2018-02-22 15:53:16.245720] I [MSGID: 100030]
>> > [glusterfsd.c:2524:main] 0-/usr/sbin/glusterfs: Started running
>> > /usr/sbin/glusterfs version 3.12.5 (args: /usr/sbin/glusterfs
>> > --volfile-server=10.0.0.21 --volfi
>> > le-server=10.0.0.22 --volfile-server=10.0.0.23
>> > --volfile-id=/virtimages
>> > /rhev/data-center/mnt/glusterSD/10.0.0.21:_virtimages)
>> > [2018-02-22 15:53:16.263712] W [MSGID: 101002]
>> > [options.c:995:xl_opt_validate] 0-glusterfs: option 'address-family'
>> > is deprecated, preferred is 'transport.address-family', continuing
>> > with correction
>> > [2018-02-22 15:53:16.269595] I [MSGID: 101190]
>> > [event-epoll.c:613:event_dispatch_epoll_worker] 0-epoll: Started
>> > thread with index 1
>> > [2018-02-22 15:53:16.273483] I [MSGID: 101190]
>> > [event-epoll.c:613:event_dispatch_epoll_worker] 0-epoll: Started
>> > thread with index 2
>> > [2018-02-22 15:53:16.273594] W [MSGID: 101174]
>> > [graph.c:363:_log_if_unknown_option] 0-virtimages-readdir-ahead:
>> > option 'parallel-readdir' is not recognized
>> > [2018-02-22 15:53:16.273703] I [MSGID: 114020] [client.c:2360:notify]
>> > 0-virtimages-client-0: parent translators are ready, attempting
>> > connect on transport
>> > [2018-02-22 15:53:16.276455] I [MSGID: 114020] [client.c:2360:notify]
>> > 0-virtimages-client-1: parent translators are ready, attempting
>> > connect on transport
>> > [2018-02-22 15:53:16.276683] I [rpc-clnt.c:1986:rpc_clnt_reconfig]
>> > 0-virtimages-client-0: changing port to 49152 (from 0)
>> > [2018-02-22 15:53:16.279191] I [MSGID: 114020] [client.c:2360:notify]
>> > 0-virtimages-client-2: parent translators are ready, attempting
>> > connect on transport
>> > [2018-02-22 15:53:16.282126] I [MSGID: 114057]
>> > [client-handshake.c:1478:select_server_supported_programs]
>> > 0-virtimages-client-0: Using Program GlusterFS 3.3, Num (1298437),
>> > Version (330)
>> > [2018-02-22 15:53:16.282573] I [MSGID: 114046]
>> > [client-handshake.c:1231:client_setvolume_cbk] 0-virtimages-client-0:
>> > Connected to virtimages-client-0, attached to remote volume
>> > '/mnt/virtimages/virtimgs'.
>> > [2018-02-22 15:53:16.282584] I [MSGID: 114047]
>> > [client-handshake.c:1242:client_setvolume_cbk] 0-virtimages-client-0:
>> > Server and Client lk-version numbers are not same, reopening the fds
>> > [2018-02-22 15:53:16.282665] I [MSGID: 108005]
>> > [afr-common.c:4929:__afr_handle_child_up_event]
>> > 0-virtimages-replicate-0: Subvolume 'virtimages-client-0' came back
>> > up; going online.
>> > [2018-02-22 15:53:16.282877] I [rpc-clnt.c:1986:rpc_clnt_reconfig]
>> > 0-virtimages-client-1: changing port to 49152 (from 0)
>> > [2018-02-22 15:53:16.282934] I [MSGID: 114035]
>> > [client-handshake.c:202:client_set_lk_version_cbk]
>> > 0-virtimages-client-0: Server lk version = 1
>> >
>> > _______________________________________________
>> > Users mailing list
>> > Users at ovirt.org <mailto:Users at ovirt.org>
>> > http://lists.ovirt.org/mailman/listinfo/users
>>
>> -- 
>> Recogizer Group GmbH
>>
>> Dr.rer.nat. Erekle Magradze
>> Lead Big Data Engineering & DevOps
>> Rheinwerkallee 2, 53227 Bonn
>> Tel: +49 228 29974555
>>
>> E-Mail erekle.magradze at recogizer.de <mailto:erekle.magradze at recogizer.de>
>> recogizer.com
>>
>> -----------------------------------------------------------------
>>
>> Recogizer Group GmbH
>> Geschäftsführer: Oliver Habisch, Carsten Kreutze
>> Handelsregister: Amtsgericht Bonn HRB 20724
>> Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
>> Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte 
>> Informationen. Wenn Sie nicht der richtige Adressat sind oder diese 
>> E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den 
>> Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie 
>> die unbefugte Weitergabe dieser Mail und der darin enthaltenen 
>> Informationen ist nicht gestattet.
>>
>> _______________________________________________
>> Users mailing list
>> Users at ovirt.org <mailto:Users at ovirt.org>
>> http://lists.ovirt.org/mailman/listinfo/users
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20180223/1b4e002d/attachment.html>


More information about the Users mailing list