On Wed, Aug 25, 2021 at 4:34 PM Gianluca Cecchi <gianluca.cecchi@gmail.com> wrote:

Same problem with 4.4.8 async 1.

I'm deploying/restoring from novirt2 and the other host (still in 4.3.10) is novirt1.

I arrive at

[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Check actual cluster location]
[ INFO  ] skipping: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Enable GlusterFS at cluster level]
[ INFO  ] skipping: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Set VLAN ID at datacenter level]
[ INFO  ] skipping: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Get active list of active firewalld zones]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Configure libvirt firewalld zone]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Add host]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Include after_add_host tasks files]
[ INFO  ] You can now connect to https://novirt2.localdomain.local:6900/ovirt-engine/ and check the status of this host and eventually remediate it, please continue only when the host is listed as 'up'
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : include_tasks]
[ INFO  ] ok: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Create temporary lock file]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Pause execution until /tmp/ansible.5f702qq5_he_setup_lock is removed, delete it once ready to proceed]

But then I'm able to connect to local engine web admin UI and novirt1 results up while novirt2 not responsive.

Every 3 seconds iinside engine.log I see these 3 lines

2021-08-27 11:05:54,065+02 INFO  [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] Connecting to novirt2.localdomain.local/172.19.0.232
2021-08-27 11:05:54,067+02 ERROR [org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-60) [] Unable to RefreshCapabilities: ConnectException: Connection refused
2021-08-27 11:05:54,068+02 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesAsyncVDSCommand] (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-60) [] Command 'GetCapabilitiesAsyncVDSCommand(HostName = novirt2.localdomain.local, VdsIdAndVdsVDSCommandParametersBase:{hostId='ca9ff6f7-5a7c-4168-9632-998c52f76cfa', vds='Host[novirt2.localdomain.local,ca9ff6f7-5a7c-4168-9632-998c52f76cfa]'})' execution failed: java.net.ConnectException: Connection refused


Can anyone telling what I can check?
Did you test SHE upgrade from 4.3.10 to 4.4.8 in your check flows?

BTW: I'm not using DNS but entries in /etc/hosts

Thanks,
Gianluca