
Hi, after running systemctl status vdsm I am getting that it's running and this message at the end. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity how critical it is? and how to solve that warning? I am using libvirt Cheers

Hi, it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine. Best regards -- Martin Sivak SLA / oVirt On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

Hi Martin, Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it? Best Erekle On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

Hi, how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM. Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled. Best regards Martin Sivak On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

This is a multi-part message in MIME format. --------------1614760AD3F5B452D5FFCFBD Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Hi, It's getting clear now, indeed momd service is disabled ● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead) mom-vdsm is enable and running. ● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf The reason why I came up with digging in mom problems is the following problem *VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer* that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure? Best Regards Erekle On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
--------------1614760AD3F5B452D5FFCFBD Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> </head> <body text="#000000" bgcolor="#FFFFFF"> <p>Hi,</p> <p>It's getting clear now, indeed momd service is disabled</p> <p>● momd.service - Memory Overcommitment Manager Daemon<br> Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled)<br> Active: inactive (dead)</p> <p>mom-vdsm is enable and running.<br> </p> <p>● mom-vdsm.service - MOM instance configured for VDSM purposes<br> Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled)<br> Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago<br> Main PID: 27638 (python)<br> CGroup: /system.slice/mom-vdsm.service<br> └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf<br> <br> The reason why I came up with digging in mom problems is the following problem<br> </p> <br> <b>VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer</b><br> <br> that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?<br> <br> Best Regards<br> Erekle<br> <br> <br> <div class="moz-cite-prefix">On 10/16/2017 03:11 PM, Martin Sivak wrote:<br> </div> <blockquote type="cite" cite="mid:CAF0zDV4VRC3zj4TYtfiHREFB7AxEN8Fj6fFwZN7MMk-mZQMHEA@mail.gmail.com"> <pre wrap="">Hi, how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM. Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled. Best regards Martin Sivak On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <a class="moz-txt-link-rfc2396E" href="mailto:erekle.magradze@recogizer.de"><erekle.magradze@recogizer.de></a> wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi Martin, Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it? Best Erekle On 10/16/2017 03:03 PM, Martin Sivak wrote: </pre> <blockquote type="cite"> <pre wrap=""> Hi, it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine. Best regards -- Martin Sivak SLA / oVirt On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <a class="moz-txt-link-rfc2396E" href="mailto:erekle.magradze@recogizer.de"><erekle.magradze@recogizer.de></a> wrote: </pre> <blockquote type="cite"> <pre wrap=""> Hi, after running systemctl status vdsm I am getting that it's running and this message at the end. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity how critical it is? and how to solve that warning? I am using libvirt Cheers _______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> </blockquote> </blockquote> </blockquote> <br> </body> </html> --------------1614760AD3F5B452D5FFCFBD--

This is a multi-part message in MIME format. --------------19FBF4BCBBA63BEF231F366A Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit It's was a typo in the failure message, that's what I was getting: *VDSM hostname command GetStatsVDS failed: Connection reset by peer* On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
*VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer*
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555 E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet. --------------19FBF4BCBBA63BEF231F366A Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> </head> <body text="#000000" bgcolor="#FFFFFF"> <p>It's was a typo in the failure message, </p> <p>that's what I was getting:<br> </p> <p><b>VDSM hostname command GetStatsVDS failed: Connection reset by peer</b></p> <br> <div class="moz-cite-prefix">On 10/16/2017 03:21 PM, Erekle Magradze wrote:<br> </div> <blockquote type="cite" cite="mid:06f6522c-44ba-9b3f-384d-a4a56a59a6ff@recogizer.de"> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> <p>Hi,</p> <p>It's getting clear now, indeed momd service is disabled</p> <p>● momd.service - Memory Overcommitment Manager Daemon<br> Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled)<br> Active: inactive (dead)</p> <p>mom-vdsm is enable and running.<br> </p> <p>● mom-vdsm.service - MOM instance configured for VDSM purposes<br> Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled)<br> Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago<br> Main PID: 27638 (python)<br> CGroup: /system.slice/mom-vdsm.service<br> └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf<br> <br> The reason why I came up with digging in mom problems is the following problem<br> </p> <br> <b>VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer</b><br> <br> that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?<br> <br> Best Regards<br> Erekle<br> <br> <br> <div class="moz-cite-prefix">On 10/16/2017 03:11 PM, Martin Sivak wrote:<br> </div> <blockquote type="cite" cite="mid:CAF0zDV4VRC3zj4TYtfiHREFB7AxEN8Fj6fFwZN7MMk-mZQMHEA@mail.gmail.com"> <pre wrap="">Hi, how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM. Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled. Best regards Martin Sivak On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <a class="moz-txt-link-rfc2396E" href="mailto:erekle.magradze@recogizer.de" moz-do-not-send="true"><erekle.magradze@recogizer.de></a> wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi Martin, Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it? Best Erekle On 10/16/2017 03:03 PM, Martin Sivak wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi, it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine. Best regards -- Martin Sivak SLA / oVirt On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <a class="moz-txt-link-rfc2396E" href="mailto:erekle.magradze@recogizer.de" moz-do-not-send="true"><erekle.magradze@recogizer.de></a> wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi, after running systemctl status vdsm I am getting that it's running and this message at the end. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity how critical it is? and how to solve that warning? I am using libvirt Cheers _______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org" moz-do-not-send="true">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users" moz-do-not-send="true">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> </blockquote> </blockquote> </blockquote> <br> <br> <fieldset class="mimeAttachmentHeader"></fieldset> <br> <pre wrap="">_______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> <br> <pre class="moz-signature" cols="72">-- Recogizer Group GmbH Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555 E-Mail <a class="moz-txt-link-abbreviated" href="mailto:erekle.magradze@recogizer.de">erekle.magradze@recogizer.de</a> Web: <a class="moz-txt-link-abbreviated" href="http://www.recogizer.com">www.recogizer.com</a> Recogizer auf LinkedIn <a class="moz-txt-link-freetext" href="https://www.linkedin.com/company-beta/10039182/">https://www.linkedin.com/company-beta/10039182/</a> Folgen Sie uns auf Twitter <a class="moz-txt-link-freetext" href="https://twitter.com/recogizer">https://twitter.com/recogizer</a> ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.</pre> </body> </html> --------------19FBF4BCBBA63BEF231F366A--

This is a multi-part message in MIME format. --------------BD80C71AA18C6FFCFC3029E1 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Hi, Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things. Also, can you provide the engine and the vdsm logs? thank you, Dafna On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
*VDSM hostname command GetStatsVDS failed: Connection reset by peer*
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
*VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer*
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
--------------BD80C71AA18C6FFCFC3029E1 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> </head> <body text="#000000" bgcolor="#FFFFFF"> <div class="moz-cite-prefix">Hi, <br> <br> Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things. <br> <br> Also, can you provide the engine and the vdsm logs? <br> <br> thank you, <br> Dafna<br> <br> <br> On 10/16/2017 02:30 PM, Erekle Magradze wrote:<br> </div> <blockquote type="cite" cite="mid:42870370-5d54-85c4-d4b5-69c5498b44a1@recogizer.de"> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> <p>It's was a typo in the failure message, </p> <p>that's what I was getting:<br> </p> <p><b>VDSM hostname command GetStatsVDS failed: Connection reset by peer</b></p> <br> <div class="moz-cite-prefix">On 10/16/2017 03:21 PM, Erekle Magradze wrote:<br> </div> <blockquote type="cite" cite="mid:06f6522c-44ba-9b3f-384d-a4a56a59a6ff@recogizer.de"> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> <p>Hi,</p> <p>It's getting clear now, indeed momd service is disabled</p> <p>● momd.service - Memory Overcommitment Manager Daemon<br> Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled)<br> Active: inactive (dead)</p> <p>mom-vdsm is enable and running.<br> </p> <p>● mom-vdsm.service - MOM instance configured for VDSM purposes<br> Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled)<br> Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago<br> Main PID: 27638 (python)<br> CGroup: /system.slice/mom-vdsm.service<br> └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf<br> <br> The reason why I came up with digging in mom problems is the following problem<br> </p> <br> <b>VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer</b><br> <br> that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?<br> <br> Best Regards<br> Erekle<br> <br> <br> <div class="moz-cite-prefix">On 10/16/2017 03:11 PM, Martin Sivak wrote:<br> </div> <blockquote type="cite" cite="mid:CAF0zDV4VRC3zj4TYtfiHREFB7AxEN8Fj6fFwZN7MMk-mZQMHEA@mail.gmail.com"> <pre wrap="">Hi, how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM. Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled. Best regards Martin Sivak On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <a class="moz-txt-link-rfc2396E" href="mailto:erekle.magradze@recogizer.de" moz-do-not-send="true"><erekle.magradze@recogizer.de></a> wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi Martin, Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it? Best Erekle On 10/16/2017 03:03 PM, Martin Sivak wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi, it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine. Best regards -- Martin Sivak SLA / oVirt On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <a class="moz-txt-link-rfc2396E" href="mailto:erekle.magradze@recogizer.de" moz-do-not-send="true"><erekle.magradze@recogizer.de></a> wrote: </pre> <blockquote type="cite"> <pre wrap="">Hi, after running systemctl status vdsm I am getting that it's running and this message at the end. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity how critical it is? and how to solve that warning? I am using libvirt Cheers _______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org" moz-do-not-send="true">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users" moz-do-not-send="true">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> </blockquote> </blockquote> </blockquote> <br> <br> <fieldset class="mimeAttachmentHeader"></fieldset> <br> <pre wrap="">_______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org" moz-do-not-send="true">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users" moz-do-not-send="true">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> <br> <pre class="moz-signature" cols="72">-- Recogizer Group GmbH Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555 E-Mail <a class="moz-txt-link-abbreviated" href="mailto:erekle.magradze@recogizer.de" moz-do-not-send="true">erekle.magradze@recogizer.de</a> Web: <a class="moz-txt-link-abbreviated" href="http://www.recogizer.com" moz-do-not-send="true">www.recogizer.com</a> Recogizer auf LinkedIn <a class="moz-txt-link-freetext" href="https://www.linkedin.com/company-beta/10039182/" moz-do-not-send="true">https://www.linkedin.com/company-beta/10039182/</a> Folgen Sie uns auf Twitter <a class="moz-txt-link-freetext" href="https://twitter.com/recogizer" moz-do-not-send="true">https://twitter.com/recogizer</a> ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.</pre> <br> <fieldset class="mimeAttachmentHeader"></fieldset> <br> <pre wrap="">_______________________________________________ Users mailing list <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org">Users@ovirt.org</a> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a> </pre> </blockquote> <p><br> </p> </body> </html> --------------BD80C71AA18C6FFCFC3029E1--

Hi, The issue is the following, after installation of ovirt 4.1 on three nodes with glusterFS as a storage, oVirt engine reported the failed events, with the following message *VDSM hostname command GetStatsVDS failed: Connection reset by peer* after that oVirt was trying to fence the affected host and it was excluded from production, luckily I am not running any VMs on it yet. The**logs are attached, don't be surprised with the hostnames :) Thanks in advance Cheers Erekle On 10/16/2017 03:37 PM, Dafna Ron wrote:
Hi,
Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things.
Also, can you provide the engine and the vdsm logs?
thank you, Dafna
On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
*VDSM hostname command GetStatsVDS failed: Connection reset by peer*
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
*VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer*
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote: > Hi, > > after running > > systemctl status vdsm I am getting that it's running and this message at > the > end. > > Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not > available. > Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not > available, > KSM stats will be missing. > Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated > in > favor of ping2 and confirmConnectivity > > how critical it is? and how to solve that warning? > > I am using libvirt > > Cheers > > _______________________________________________ > Users mailing list > Users@ovirt.org > http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mailerekle.magradze@recogizer.de Web:www.recogizer.com
Recogizer auf LinkedInhttps://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitterhttps://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555 E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.

Erekle, In the logs you provided I see: IOError: [Errno 5] _handleRequests._checkForMail - Could not read mailbox: /rhev/data-center/6d52512e-1c02-4509-880a-bf57cbad4bdf/mastersd/dom_md/inbox and StorageDomainMasterError: Error validating master storage domain: ('MD read error',) which seems to be cause for vdsm being killed by sanlock which caused connection reset by peer. After vdsm restart storage looks good. @Nir can you take a look? Thanks, Piotr On Mon, Oct 16, 2017 at 3:59 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
The issue is the following, after installation of ovirt 4.1 on three nodes with glusterFS as a storage, oVirt engine reported the failed events, with the following message
VDSM hostname command GetStatsVDS failed: Connection reset by peer
after that oVirt was trying to fence the affected host and it was excluded from production, luckily I am not running any VMs on it yet.
The logs are attached, don't be surprised with the hostnames :)
Thanks in advance
Cheers
Erekle
On 10/16/2017 03:37 PM, Dafna Ron wrote:
Hi,
Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things.
Also, can you provide the engine and the vdsm logs?
thank you, Dafna
On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
VDSM hostname command GetStatsVDS failed: Connection reset by peer
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

Hi Piotr, Several times I've restarted vdsm daemon on certain nods, that could be the reason. The failure, I've mentioned, has happened yesterday from 15:00 to 17:00 Cheers Erekle On 10/16/2017 04:13 PM, Piotr Kliczewski wrote:
Erekle,
In the logs you provided I see:
IOError: [Errno 5] _handleRequests._checkForMail - Could not read mailbox: /rhev/data-center/6d52512e-1c02-4509-880a-bf57cbad4bdf/mastersd/dom_md/inbox
and
StorageDomainMasterError: Error validating master storage domain: ('MD read error',)
which seems to be cause for vdsm being killed by sanlock which caused connection reset by peer.
After vdsm restart storage looks good.
@Nir can you take a look?
Thanks, Piotr
On Mon, Oct 16, 2017 at 3:59 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
The issue is the following, after installation of ovirt 4.1 on three nodes with glusterFS as a storage, oVirt engine reported the failed events, with the following message
VDSM hostname command GetStatsVDS failed: Connection reset by peer
after that oVirt was trying to fence the affected host and it was excluded from production, luckily I am not running any VMs on it yet.
The logs are attached, don't be surprised with the hostnames :)
Thanks in advance
Cheers
Erekle
On 10/16/2017 03:37 PM, Dafna Ron wrote:
Hi,
Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things.
Also, can you provide the engine and the vdsm logs?
thank you, Dafna
On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
VDSM hostname command GetStatsVDS failed: Connection reset by peer
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555 E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.

Erekle, For the time period you mentioned I do not see anything wrong on vdsm side except of a restart at 2017-10-15 16:28:50,993+0200. It looks like manual restart. The engine log starts at 2017-10-16 03:49:04,092+02 so not able to say whether there was anything else except of heartbeat issue caused by the restart. The restart was the cause of "connection reset by peer" on mom side. Thanks, Piotr On Mon, Oct 16, 2017 at 4:21 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Piotr,
Several times I've restarted vdsm daemon on certain nods, that could be the reason.
The failure, I've mentioned, has happened yesterday from 15:00 to 17:00
Cheers
Erekle
On 10/16/2017 04:13 PM, Piotr Kliczewski wrote:
Erekle,
In the logs you provided I see:
IOError: [Errno 5] _handleRequests._checkForMail - Could not read mailbox: /rhev/data-center/6d52512e-1c02-4509-880a-bf57cbad4bdf/mastersd/dom_md/inbox
and
StorageDomainMasterError: Error validating master storage domain: ('MD read error',)
which seems to be cause for vdsm being killed by sanlock which caused connection reset by peer.
After vdsm restart storage looks good.
@Nir can you take a look?
Thanks, Piotr
On Mon, Oct 16, 2017 at 3:59 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
The issue is the following, after installation of ovirt 4.1 on three nodes with glusterFS as a storage, oVirt engine reported the failed events, with the following message
VDSM hostname command GetStatsVDS failed: Connection reset by peer
after that oVirt was trying to fence the affected host and it was excluded from production, luckily I am not running any VMs on it yet.
The logs are attached, don't be surprised with the hostnames :)
Thanks in advance
Cheers
Erekle
On 10/16/2017 03:37 PM, Dafna Ron wrote:
Hi,
Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things.
Also, can you provide the engine and the vdsm logs?
thank you, Dafna
On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
VDSM hostname command GetStatsVDS failed: Connection reset by peer
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.

That's the problem, at that time nobody has restarted the server. Is there any scenario when the hypervisor is restarted by engine? Cheers Erekle On 10/16/2017 04:45 PM, Piotr Kliczewski wrote:
Erekle,
For the time period you mentioned I do not see anything wrong on vdsm side except of a restart at 2017-10-15 16:28:50,993+0200. It looks like manual restart. The engine log starts at 2017-10-16 03:49:04,092+02 so not able to say whether there was anything else except of heartbeat issue caused by the restart.
The restart was the cause of "connection reset by peer" on mom side.
Thanks, Piotr
On Mon, Oct 16, 2017 at 4:21 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Piotr,
Several times I've restarted vdsm daemon on certain nods, that could be the reason.
The failure, I've mentioned, has happened yesterday from 15:00 to 17:00
Cheers
Erekle
On 10/16/2017 04:13 PM, Piotr Kliczewski wrote:
Erekle,
In the logs you provided I see:
IOError: [Errno 5] _handleRequests._checkForMail - Could not read mailbox: /rhev/data-center/6d52512e-1c02-4509-880a-bf57cbad4bdf/mastersd/dom_md/inbox
and
StorageDomainMasterError: Error validating master storage domain: ('MD read error',)
which seems to be cause for vdsm being killed by sanlock which caused connection reset by peer.
After vdsm restart storage looks good.
@Nir can you take a look?
Thanks, Piotr
On Mon, Oct 16, 2017 at 3:59 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
The issue is the following, after installation of ovirt 4.1 on three nodes with glusterFS as a storage, oVirt engine reported the failed events, with the following message
VDSM hostname command GetStatsVDS failed: Connection reset by peer
after that oVirt was trying to fence the affected host and it was excluded from production, luckily I am not running any VMs on it yet.
The logs are attached, don't be surprised with the hostnames :)
Thanks in advance
Cheers
Erekle
On 10/16/2017 03:37 PM, Dafna Ron wrote:
Hi,
Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things.
Also, can you provide the engine and the vdsm logs?
thank you, Dafna
On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
VDSM hostname command GetStatsVDS failed: Connection reset by peer
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
-- Recogizer Group GmbH Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555 E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.

On Mon, Oct 16, 2017 at 4:51 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
That's the problem, at that time nobody has restarted the server.
Please provide engine log from this time so we could see whether it was trigger by it.
Is there any scenario when the hypervisor is restarted by engine?
Cheers
Erekle
On 10/16/2017 04:45 PM, Piotr Kliczewski wrote:
Erekle,
For the time period you mentioned I do not see anything wrong on vdsm side except of a restart at 2017-10-15 16:28:50,993+0200. It looks like manual restart. The engine log starts at 2017-10-16 03:49:04,092+02 so not able to say whether there was anything else except of heartbeat issue caused by the restart.
The restart was the cause of "connection reset by peer" on mom side.
Thanks, Piotr
On Mon, Oct 16, 2017 at 4:21 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Piotr,
Several times I've restarted vdsm daemon on certain nods, that could be the reason.
The failure, I've mentioned, has happened yesterday from 15:00 to 17:00
Cheers
Erekle
On 10/16/2017 04:13 PM, Piotr Kliczewski wrote:
Erekle,
In the logs you provided I see:
IOError: [Errno 5] _handleRequests._checkForMail - Could not read mailbox:
/rhev/data-center/6d52512e-1c02-4509-880a-bf57cbad4bdf/mastersd/dom_md/inbox
and
StorageDomainMasterError: Error validating master storage domain: ('MD read error',)
which seems to be cause for vdsm being killed by sanlock which caused connection reset by peer.
After vdsm restart storage looks good.
@Nir can you take a look?
Thanks, Piotr
On Mon, Oct 16, 2017 at 3:59 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
The issue is the following, after installation of ovirt 4.1 on three nodes with glusterFS as a storage, oVirt engine reported the failed events, with the following message
VDSM hostname command GetStatsVDS failed: Connection reset by peer
after that oVirt was trying to fence the affected host and it was excluded from production, luckily I am not running any VMs on it yet.
The logs are attached, don't be surprised with the hostnames :)
Thanks in advance
Cheers
Erekle
On 10/16/2017 03:37 PM, Dafna Ron wrote:
Hi,
Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things.
Also, can you provide the engine and the vdsm logs?
thank you, Dafna
On 10/16/2017 02:30 PM, Erekle Magradze wrote:
It's was a typo in the failure message,
that's what I was getting:
VDSM hostname command GetStatsVDS failed: Connection reset by peer
On 10/16/2017 03:21 PM, Erekle Magradze wrote:
Hi,
It's getting clear now, indeed momd service is disabled
● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead)
mom-vdsm is enable and running.
● mom-vdsm.service - MOM instance configured for VDSM purposes Loaded: loaded (/usr/lib/systemd/system/mom-vdsm.service; enabled; vendor preset: enabled) Active: active (running) since Mon 2017-10-16 15:14:35 CEST; 1min 3s ago Main PID: 27638 (python) CGroup: /system.slice/mom-vdsm.service └─27638 python /usr/sbin/momd -c /etc/vdsm/mom.conf
The reason why I came up with digging in mom problems is the following problem
VDSM hostname command GetStatsVDSThanks failed: Connection reset by peer
that is causing fencing of the node where the failure is happening, what could be the reason of GetStatsVDS failure?
Best Regards Erekle
On 10/16/2017 03:11 PM, Martin Sivak wrote:
Hi,
how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM.
Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be enabled.
Best regards
Martin Sivak
On Mon, Oct 16, 2017 at 3:04 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi Martin,
Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it?
Best
Erekle
On 10/16/2017 03:03 PM, Martin Sivak wrote:
Hi,
it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine.
Best regards
-- Martin Sivak SLA / oVirt
On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze <erekle.magradze@recogizer.de> wrote:
Hi,
after running
systemctl status vdsm I am getting that it's running and this message at the end.
Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16 14:26:57 hostname vdsmd[2392]: vdsm root WARN ping was deprecated in favor of ping2 and confirmConnectivity
how critical it is? and how to solve that warning?
I am using libvirt
Cheers
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com
Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer
----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993
Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
-- Recogizer Group GmbH
Dr.rer.nat. Erekle Magradze Lead Big Data Engineering & DevOps Rheinwerkallee 2, 53227 Bonn Tel: +49 228 29974555
E-Mail erekle.magradze@recogizer.de Web: www.recogizer.com Recogizer auf LinkedIn https://www.linkedin.com/company-beta/10039182/ Folgen Sie uns auf Twitter https://twitter.com/recogizer ----------------------------------------------------------------- Recogizer Group GmbH Geschäftsführer: Oliver Habisch, Carsten Kreutze Handelsregister: Amtsgericht Bonn HRB 20724 Sitz der Gesellschaft: Bonn; USt-ID-Nr.: DE294195993 Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und löschen Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail und der darin enthaltenen Informationen ist nicht gestattet.
participants (4)
-
Dafna Ron
-
Erekle Magradze
-
Martin Sivak
-
Piotr Kliczewski