I'm working on setting up my environment prior to production, and have run into an issue.

I got most things configured, but due to a limitation on one of my switches, I decided to change the management vlan that the hosts communicate on. Over the course of changing that vlan, I wound up resetting my router to default settings.

I have the router operational again, and I also have 1 of my switches operational.
Now, I'm trying to bring the oVirt cluster back online.
This is oVirt 4.5 running on RHEL 8.3.

The old vlan is 1, and the new vlan is 10.

Currently, hosts 2 & 3 are accessible over the new vlan, and can ping each other.
I'm able to ssh to both hosts, and when I run "gluster peer status", I see that they are connected to each other.

However, host 1 is not accessible from anything. I can't ping it, and it cannot get out.

As part of my troubleshooting, I've done the following:
From the host console, I ran `nmcli connection delete` to delete the old vlan (VLAN 1).
I moved the /etc/sysconfig/network-scripts/interface.1 file to interface.10, and edited the file accordingly to make sure the vlan and device settings are set to 10 instead of 1, and I rebooted the host.

The engine seems to be running, but I don't understand why.
From each of the hosts that are working (host 2 and host 3), I ran "hosted-engine --check-liveliness" and both hosts indicate that the engine is NOT running.

Yet the engine loads in a web browser, and I'm able to log into /ovirt-engine/webadmin/.
The engine thinks that all 3 hosts is nonresponsive. See screenshot below:

Screenshot from 2021-04-07 17-33-48.png

What I'm really looking for help with is to get the first host back online.
Once it is healthy and gluster is healthy, I feel confident I can get the engine operational again.

What else should I look for on this host? 


Sent with ProtonMail Secure Email.