New subject: 3node HCI fails when HostedEngineLocal is trying to add additional Gluster members

5 Dec 2019

      Most probably the vdsm or supervdsm's PreExec task is doing it (they got multiple, so you can run manually till you find it out).
Just try the following:
systemctl stop vdsmd supervdsmd
systemctl start supervdsmd
Check for certs
systemctl start vdsmd

Keep in mind that that the chain of events (at least for me is):
1. VG activation
2. VDO activation
3. Gluster brick is mounted (I use systemd service due to deps between vdo, gluster brick and glusterd)
4. Glusterd and libvirt are started
5. Sanlock is started
6. Supervdsm
7. Vdsm
If this is a host that will host HostedEngine VM:
8. Ovirt-ha-broker
9. Ovirt-ha-agent

After cleanup, did you reboot?

Best Regards,
Strahil NikolovOn Dec 4, 2019 17:14, thomas@hoberg.net wrote:
...
After spending another couple of hours trying to track down the problem, I have found that the "lost connection" seems due to KVM shutting down, because it cannot find the certificates for the Spice and VNC connections in /etc/pki/vdsm/*, where 'ovirt-hosted-engine-cleanup' deleted them.
So now I wonder: Who is supposed to (re-)generated them afterwards?
Assuming that it was a much earlier step I proceeded to completely undo the deployment, get rid of the Gluster setup etc. and start from the very beginning, only to find that that didn't change a thing: It still missed those certificates....
...while something or someone *did* generated them when I tried a distinct and new set of nodes for counter-testing..
That setup failed with an Ansible error (reported separately), but I have now grown afraid of using 'ovirt-hosted-engine-cleanup' when I don't know how to get the ciphers/keys for /etc/pki/vsdm/{spice|vnc} regenerated...
Can anyone shed some light into this darkness?
_______________________________________________
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-leave@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/
List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/Z7AFFFU6KMDPSB...

Re: 3node HCI fails when HostedEngineLocal is trying to add additional Gluster members

Strahil

thomas＠hoberg.net

tags

participants (2)