<div dir="ltr">Hi ,<div><br></div><div>  Can you please check the following. Following could be one of the reason why HE vm restarts every minute.</div><div><br></div><div><span id="gmail-docs-internal-guid-2d16663b-4f2a-4254-bd1a-13721a47c932"><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">Check the error or engine health state. If it’s to do with Liveliness check, then this is mostly an issue connecting to engine.</span></p><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"> - Check if engine FQDN is reachable from all hosts</span></p><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">-  curl -v http://&lt;engine-fqdn&gt;/ovirt-engine/services/health - does this return ok?</span></p><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"> - Access the HE console and check if ovirt-engine is running. </span></p><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">- Check /var/log/ovirt-engine/server.log or /var/log/ovirt-engine/engine.log if there are errors starting ovirt-engine</span></p><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br></span></p><p style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">Thanks</span></p><p style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">kasturi</span></p><div><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br></span></div></span></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Jul 14, 2017 at 10:28 PM, Sven Achtelik <span dir="ltr">&lt;<a href="mailto:Sven.Achtelik@eps.aero" target="_blank">Sven.Achtelik@eps.aero</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div lang="DE" link="#0563C1" vlink="#954F72"><div class="m_-7803682938837682244WordSection1"><p class="MsoNormal">Hi All, <u></u><u></u></p><p class="MsoNormal"><u></u> <u></u></p><p class="MsoNormal"><span lang="EN-US">after running solid for several month my ovirt-engine started rebooting on several hosts. I’ve looked into the hostend-engine –vm-status and it sees that the engine is up on one host but not reachable. At the same time I can access the gui and everything is working fine. After some time the engine is shutting down and all hosts are trying to start the engine until one is the winner, at least it looks like this. Any clues where to look at and find the issue with the liveliness check ? <u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">------------------------------<wbr>------------------------------<wbr>------------------------------<wbr>--------------<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">--== Host 1 status ==--<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">conf_on_shared_storage             : True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Status up-to-date                  : True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Hostname                      <wbr>     : ovirt-node01<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Host ID                            : 1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Engine status                      : {&quot;reason&quot;: &quot;vm not running on this host&quot;, &quot;health&quot;: &quot;bad&quot;, &quot;vm&quot;: &quot;down&quot;, &quot;detail&quot;: &quot;unknown&quot;}<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Score                         <wbr>     : 3400<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">stopped                       <wbr>     : False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Local maintenance                  : False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">crc32                         <wbr>     : 3eb33843<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">local_conf_timestamp          <wbr>     : 17128<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Host timestamp                     : 17113<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Extra metadata (valid at timestamp):<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        metadata_parse_version=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        metadata_feature_version=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        timestamp=17113 (Fri Jul 14 11:50:23 2017)<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        host-id=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        score=3400<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        vm_conf_refresh_time=17128 (Fri Jul 14 11:50:38 2017)<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        conf_on_shared_storage=True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        maintenance=False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        state=EngineDown<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        stopped=False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">--== Host 2 status ==--<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">conf_on_shared_storage        <wbr>     : True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Status up-to-date                  : True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Hostname                      <wbr>     : ovirt-node02.mgmt.lan<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Host ID                            : 2<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Engine status                      : {&quot;reason&quot;: &quot;failed liveliness check&quot;, &quot;health&quot;: &quot;bad&quot;, &quot;vm&quot;: &quot;up&quot;, &quot;detail&quot;: &quot;up&quot;}<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Score                         <wbr>     : 3400<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">stopped                       <wbr>     : False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Local maintenance                  : False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">crc32                         <wbr>     : 2a8c86cc<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">local_conf_timestamp          <wbr>     : 523182<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Host timestamp                     : 523167<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Extra metadata (valid at timestamp):<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        metadata_parse_version=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        metadata_feature_version=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        timestamp=523167 (Fri Jul 14 11:50:25 2017)<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        host-id=2<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        score=3400<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        vm_conf_refresh_time=523182 (Fri Jul 14 11:50:40 2017)<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        conf_on_shared_storage=True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        maintenance=False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        state=EngineStarting<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        stopped=False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">--== Host 3 status ==--<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">conf_on_shared_storage        <wbr>     : True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Status up-to-date                  : True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Hostname                      <wbr>     : ovirt-node03.mgmt.lan<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Host ID                            : 3<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Engine status                      : {&quot;reason&quot;: &quot;vm not running on this host&quot;, &quot;health&quot;: &quot;bad&quot;, &quot;vm&quot;: &quot;down&quot;, &quot;detail&quot;: &quot;unknown&quot;}<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Score                              : 3400<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">stopped                       <wbr>     : False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Local maintenance                  : False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">crc32                         <wbr>     : f8490d79<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">local_conf_timestamp          <wbr>     : 527698<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Host timestamp                     : 527683<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Extra metadata (valid at timestamp):<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        metadata_parse_version=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        metadata_feature_version=1<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        timestamp=527683 (Fri Jul 14 11:50:33 2017)<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        host-id=3<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        score=3400<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        vm_conf_refresh_time=527698 (Fri Jul 14 11:50:47 2017)<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        conf_on_shared_storage=True<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        maintenance=False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        state=EngineDown<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">        stopped=False<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p><p class="MsoNormal"><span lang="EN-US">------------------------------<wbr>------------------------------<wbr>------------------------------<wbr>----<u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Thank you, <u></u><u></u></span></p><p class="MsoNormal"><span lang="EN-US">Sven <u></u><u></u></span></p></div></div><br>______________________________<wbr>_________________<br>
Users mailing list<br>
<a href="mailto:Users@ovirt.org">Users@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/<wbr>mailman/listinfo/users</a><br>
<br></blockquote></div><br></div>