<div dir="ltr"><div>thanks a lot for your help!<br><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">2016-12-13 12:07 GMT-03:00 Yedidyah Bar David <span dir="ltr"><<a href="mailto:didi@redhat.com" target="_blank">didi@redhat.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On Tue, Dec 13, 2016 at 4:58 PM, Juan Pablo <<a href="mailto:pablo.localhost@gmail.com">pablo.localhost@gmail.com</a>> wrote:<br>
> thanks for pointing me on the right direction , I have this line a couple of<br>
> minutes before the vm restart<br>
> ":states::128::ovirt_hosted_<wbr>engine_ha.agent.hosted_engine.<wbr>HostedEngine::(score)<br>
> Penalizing score by 1600 due to gateway status"<br>
> so looks like this is causing:<br>
> states::413::ovirt_hosted_<wbr>engine_ha.agent.hosted_engine.<wbr>HostedEngine::(consume)<br>
> Host virt01-int.xxxx.xxxxxx (id 1) score is significantly better than local<br>
> score, shutting down VM on this host<br>
> is this a network related issue? hosted engine and hosts are on the same<br>
> vlan, does a gateway check should be triggering a hosted engine shutdown?<br>
<br>
</span>Seems so.<br>
<br>
ping to the gateway is an important test, because if it fails it might<br>
mean a split-brain.<br>
When you are asked about a 'gateway address', it's actually used only for that.<br>
It does not need to be your gateway, but it does need to be a very<br>
reliable thing that should always reply.<br>
<br>
Best,<br>
<div class="HOEnZb"><div class="h5"><br>
><br>
><br>
> thanks!<br>
> JP<br>
><br>
> 2016-12-13 11:37 GMT-03:00 Yedidyah Bar David <<a href="mailto:didi@redhat.com">didi@redhat.com</a>>:<br>
>><br>
>> On Tue, Dec 13, 2016 at 4:34 PM, Juan Pablo <<a href="mailto:pablo.localhost@gmail.com">pablo.localhost@gmail.com</a>><br>
>> wrote:<br>
>> > Hi guys,<br>
>> > I have ovirt 4.0.5 with 3 hosts and 1 storage setup, using iscsi for<br>
>> > data<br>
>> > and nfs for hosted engine storage.<br>
>> > storage network is on a private vlan.<br>
>> > sometimes I see ETL service stopped / ETL service started in the events<br>
>> > log,<br>
>> > side by side with a hosted engine stop/start...<br>
>> > also, sometimes I get kicked out of the admin portal with no reason<br>
>> > I had another issue which was related to<br>
>> > <a href="https://bugzilla.redhat.com/show_bug.cgi?id=1349829" rel="noreferrer" target="_blank">https://bugzilla.redhat.com/<wbr>show_bug.cgi?id=1349829</a> but looks like it's<br>
>> > harmless so maybe Im not seeing the problem.<br>
>> ><br>
>> > can you please guide me on finding the issue here?<br>
>><br>
>> You should start by checking: /var/log/ovirt-hosted-engine-<wbr>ha/agent.log.<br>
>><br>
>> Best,<br>
>><br>
>> ><br>
>> > best regards,<br>
>> > JP<br>
>> ><br>
>> > ______________________________<wbr>_________________<br>
>> > Users mailing list<br>
>> > <a href="mailto:Users@ovirt.org">Users@ovirt.org</a><br>
>> > <a href="http://lists.phx.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.phx.ovirt.org/<wbr>mailman/listinfo/users</a><br>
>> ><br>
>><br>
>><br>
>><br>
>> --<br>
>> Didi<br>
><br>
><br>
<br>
<br>
<br>
</div></div><span class="HOEnZb"><font color="#888888">--<br>
Didi<br>
</font></span></blockquote></div><br></div>