On Mon, Mar 1, 2021, 15:20 <souvaliotimaria@mail.com> wrote:
Hello again,

I am back with a brief description of the situation I am in, and questions about the recovery.

oVirt environment: 4.3.5.2 Hyperconverged
GlusterFS: Replica 2 + Arbiter 1
GlusterFS volumes: data, engine, vmstore

The current situation is the following:

- The Cluster is in Global Maintenance.

- The volume engine is up with comment (in the Web GUI) : Up, unsynched entries, needs healing.

- The VM HostedEngine is paused due to a storage I/O error (Web GUI) while the output of virsh list --all command shows that the HostedEngine is running.

I tried to issue the gluster heal command (gluster volume heal engine) but nothing changed.

I have the following questions:

1. Should I restart the glusterd service? Where from? Is it enough if the glusterd is restarted on one host or should it be restarted on the other two as well?
It sounds as a gluster split brain. I would start from there. Can you check status by listing split brain entries?

2. Should the node that was NonResponsive and came back, be rebooted or not? It seems alright now and in good health.

3. Should the HostedEngine be restored with engine-backup or is it not necessary?

4. Could the loss of the DNS server for the oVirt hosts lead to an unresponsive host?
The nsswitch file on the ovirt hosts and engine, has the DNS defined as:
hosts:      files dns myhostname
If you have opted for dns liveliness checks it could be.

5. How can we recover/rectify the situation above?
I would start checking for gluster split brains and ensure that all hosts have connectivity in the storage domain net (ping, jumbo frames if enabled). 99% of my similar issues have been caused from gluster split.

The fact that the engine is shown as paused and that you can still access web ui makes me think you have a split brain issue 

Thanks for your help,
Maria Souvalioti
_______________________________________________
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-leave@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/
List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/GO6S6GXRJWYZN5NZ5IFTNQ6SGNEB75WQ/