<div dir="ltr"><div class="gmail_quote"><div dir="ltr">Thanks Martin.<div><br></div><div>As you suggested I updated hosted-engine.conf with correct host_id values and restarted ovirt-ha-agent services on both hosts and now I run into the problem with status "unknown-stale-data" :(<br>And second host still doesn't looks as capable to run HE.</div><div><br></div><div>Should I stop HE VM, bring down ovirt-ha-agents and reinitialize-lockspace and start ovirt-ha-agents again?</div><div><br></div><div>Regards,</div><div>Artem<br><br><br></div></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Feb 19, 2018 at 6:45 PM, Martin Sivak <span dir="ltr"><<a href="mailto:msivak@redhat.com" target="_blank">msivak@redhat.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Artem,<br>
<br>
just a restart of ovirt-ha-agent services should be enough.<br>
<br>
Best regards<br>
<br>
Martin Sivak<br>
<br>
On Mon, Feb 19, 2018 at 4:40 PM, Artem Tambovskiy<br>
<div class="m_-4283445358333830619HOEnZb"><div class="m_-4283445358333830619h5"><<a href="mailto:artem.tambovskiy@gmail.com" target="_blank">artem.tambovskiy@gmail.com</a>> wrote:<br>
> Ok, understood.<br>
> Once I set correct host_id on both hosts how to take changes in force? With<br>
> minimal downtime? Or i need reboot both hosts anyway?<br>
><br>
> Regards,<br>
> Artem<br>
><br>
> 19 февр. 2018 г. 18:18 пользователь "Simone Tiraboschi"<br>
> <<a href="mailto:stirabos@redhat.com" target="_blank">stirabos@redhat.com</a>> написал:<br>
><br>
>><br>
>><br>
>> On Mon, Feb 19, 2018 at 4:12 PM, Artem Tambovskiy<br>
>> <<a href="mailto:artem.tambovskiy@gmail.com" target="_blank">artem.tambovskiy@gmail.com</a>> wrote:<br>
>>><br>
>>><br>
>>> Thanks a lot, Simone!<br>
>>><br>
>>> This is clearly shows a problem:<br>
>>><br>
>>> [root@ov-eng ovirt-engine]# sudo -u postgres psql -d engine -c 'select<br>
>>> vds_name, vds_spm_id from vds'<br>
>>> vds_name | vds_spm_id<br>
>>> -----------------+------------<br>
>>> ovirt1.local | 2<br>
>>> ovirt2.local | 1<br>
>>> (2 rows)<br>
>>><br>
>>> While hosted-engine.conf on ovirt1.local have host_id=1, and ovirt2.local<br>
>>> host_id=2. So totally opposite values.<br>
>>> So how to get this fixed in the simple way? Update the engine DB?<br>
>><br>
>><br>
>> I'd suggest to manually fix /etc/ovirt-hosted-engine/hoste<wbr>d-engine.conf on<br>
>> both the hosts<br>
>><br>
>>><br>
>>><br>
>>> Regards,<br>
>>> Artem<br>
>>><br>
>>> On Mon, Feb 19, 2018 at 5:37 PM, Simone Tiraboschi <<a href="mailto:stirabos@redhat.com" target="_blank">stirabos@redhat.com</a>><br>
>>> wrote:<br>
>>>><br>
>>>><br>
>>>><br>
>>>> On Mon, Feb 19, 2018 at 12:13 PM, Artem Tambovskiy<br>
>>>> <<a href="mailto:artem.tambovskiy@gmail.com" target="_blank">artem.tambovskiy@gmail.com</a>> wrote:<br>
>>>>><br>
>>>>> Hello,<br>
>>>>><br>
>>>>> Last weekend my cluster suffered form a massive power outage due to<br>
>>>>> human mistake.<br>
>>>>> I'm using SHE setup with Gluster, I managed to bring the cluster up<br>
>>>>> quickly, but once again I have a problem with duplicated host_id<br>
>>>>> (<a href="https://bugzilla.redhat.com/show_bug.cgi?id=1543988" rel="noreferrer" target="_blank">https://bugzilla.redhat.com/s<wbr>how_bug.cgi?id=1543988</a>) on second host and due<br>
>>>>> to this second host is not capable to run HE.<br>
>>>>><br>
>>>>> I manually updated file hosted_engine.conf with correct host_id and<br>
>>>>> restarted agent & broker - no effect. Than I rebooted the host itself -<br>
>>>>> still no changes. How to fix this issue?<br>
>>>><br>
>>>><br>
>>>> I'd suggest to run this command on the engine VM:<br>
>>>> sudo -u postgres scl enable rh-postgresql95 -- psql -d engine -c<br>
>>>> 'select vds_name, vds_spm_id from vds'<br>
>>>> (just sudo -u postgres psql -d engine -c 'select vds_name, vds_spm_id<br>
>>>> from vds' if still on 4.1) and check<br>
>>>> /etc/ovirt-hosted-engine/hoste<wbr>d-engine.conf on all the involved host.<br>
>>>> Maybe you can also have a leftover configuration file on undeployed<br>
>>>> host.<br>
>>>><br>
>>>> When you find a conflict you should manually bring down sanlock<br>
>>>> In doubt a reboot of both the hosts will solve for sure.<br>
>>>><br>
>>>><br>
>>>>><br>
>>>>><br>
>>>>> Regards,<br>
>>>>> Artem<br>
>>>>><br>
>>>>> ______________________________<wbr>_________________<br>
>>>>> Users mailing list<br>
>>>>> <a href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br>
>>>>> <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br>
>>>>><br>
>>>><br>
>>><br>
>>><br>
>>><br>
>>> ______________________________<wbr>_________________<br>
>>> Users mailing list<br>
>>> <a href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br>
>>> <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br>
>>><br>
>><br>
><br>
> ______________________________<wbr>_________________<br>
> Users mailing list<br>
> <a href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br>
> <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br>
><br>
</div></div></blockquote></div><br></div>
</div></div></div><br></div>