New subject: Fwd: why host is not capable to run HE?

19 Feb 2018

      Thanks Martin.

As you suggested I updated hosted-engine.conf with correct host_id values
and restarted ovirt-ha-agent services on both hosts and now I run into the
problem with  status "unknown-stale-data" :(
And second host still doesn't looks as capable to run HE.

Should I stop HE VM, bring down ovirt-ha-agents and reinitialize-lockspace
and start ovirt-ha-agents again?

Regards,
Artem

On Mon, Feb 19, 2018 at 6:45 PM, Martin Sivak <msivak@redhat.com> wrote:
...
Hi Artem,
just a restart of ovirt-ha-agent services should be enough.
Best regards
Martin Sivak
On Mon, Feb 19, 2018 at 4:40 PM, Artem Tambovskiy
<artem.tambovskiy@gmail.com> wrote:
...
Ok, understood.
Once I set correct host_id on both hosts how to take changes in force?
With
minimal downtime? Or i need reboot both hosts anyway?
Regards,
Artem
19 февр. 2018 г. 18:18 пользователь "Simone Tiraboschi"
<stirabos@redhat.com> написал:
...
On Mon, Feb 19, 2018 at 4:12 PM, Artem Tambovskiy
<artem.tambovskiy@gmail.com> wrote:
...
Thanks a lot, Simone!
This is clearly shows a problem:
[root@ov-eng ovirt-engine]# sudo -u postgres psql -d engine -c 'select
vds_name, vds_spm_id from vds'
    vds_name     | vds_spm_id
-----------------+------------
 ovirt1.local |          2
 ovirt2.local |          1
(2 rows)
While hosted-engine.conf on ovirt1.local have host_id=1, and
ovirt2.local
...
...
host_id=2. So totally opposite values.
So how to get this fixed in the simple way? Update the engine DB?
I'd suggest to manually fix /etc/ovirt-hosted-engine/hosted-engine.conf
on
both the hosts
...
Regards,
Artem
On Mon, Feb 19, 2018 at 5:37 PM, Simone Tiraboschi <
stirabos@redhat.com>
...
wrote:
...
On Mon, Feb 19, 2018 at 12:13 PM, Artem Tambovskiy
<artem.tambovskiy@gmail.com> wrote:
...
Hello,
Last weekend my cluster suffered form a massive power outage due to
human mistake.
I'm using SHE setup with Gluster, I managed to bring the cluster up
quickly, but once again I have a problem with duplicated host_id
(https://bugzilla.redhat.com/show_bug.cgi?id=1543988) on second
host and due
...
...
to this second host is not capable to run HE.
I manually updated file hosted_engine.conf with correct host_id and
restarted agent & broker - no effect. Than I rebooted the host
itself -
still no changes. How to fix this issue?
I'd suggest to run this command on the engine VM:
sudo -u postgres scl enable rh-postgresql95 --  psql -d engine -c
'select vds_name, vds_spm_id from vds'
(just  sudo -u postgres psql -d engine -c 'select vds_name, vds_spm_id
from vds'  if still on 4.1) and check
/etc/ovirt-hosted-engine/hosted-engine.conf on all the involved host.
Maybe you can also have a leftover configuration file on undeployed
host.
When you find a conflict you should manually bring down sanlock
In doubt a reboot of both the hosts will solve for sure.
...
Regards,
Artem
_______________________________________________
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Fwd: Fwd: why host is not capable to run HE?

Artem Tambovskiy

Artem Tambovskiy

tags

participants (1)