Dear all,
I think I finally understood the issue, even if I don't know how to fix it.

Trying to install a new HE from a backup I get the error:
 "The host has been set in non_operational status, please check engine logs, more info can be found in the engine logs, fix accordingly and re-deploy."

The host, not the hosted engine. This is more clear in another log:
Host <removed_for_privacy> is set to Non-Operational, it is missing the following networks: 'iscsi_net,sgsi_iscsi,sgsi_priv,sgsi_vpn'

The fact is that those networks are present on the host:
# ip addr
<CUT>
26: sgsi_priv: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
    link/ether 90:e2:ba:63:2e:bc brd ff:ff:ff:ff:ff:ff
    inet6 fe80::92e2:baff:fe63:2ebc/64 scope link
28: sgsi_vpn: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
    link/ether 90:e2:ba:63:2e:bc brd ff:ff:ff:ff:ff:ff
    inet6 fe80::92e2:baff:fe63:2ebc/64 scope link
       valid_lft forever preferred_lft forever

The other two are configured on ovirt but not configurable on bare metal system, indeed if I issue "ip addr" on a production host I don't see those nets at all: I am puzzled. The problem is definitely this one, can anyone provide any suggestion on how to proceed?
Why is it complaining about sgsi_priv and sgsi_vpn that are not missing at all?

Andrea


On 15/07/2020 08:33, Yedidyah Bar David wrote:
On Tue, Jul 14, 2020 at 6:04 PM Andrea Chierici
<andrea.chierici@cnaf.infn.it> wrote:
Hi,
thank you for your help.


I think this is not a critical failure, and is not what failed the restore.



Recently I tried the 4.3.11 beta and 4.4.1 and the error now is different:

[ INFO  ] Upgrading CA\n[ ERROR ] Failed to execute stage 'Misc configuration': (2, 'No such file or directory')\n[ INFO  ] DNF Performing DNF transaction rollback\n

This is part of 'engine-setup' output, which 'hosted-engine' runs inside the engine VM. If you can access the engine VM, you can try finding more information in /var/log/ovirt-engine/setup/* there. Otherwise, the hosted-engine deploy script might have managed to get a copy to /var/log/ovirt-hosted-engine-setup/engine-logs*. Please check/share these. Thanks.


Unfortunately the installation procedures when exiting, deletes the vm, hence I can't log in.
Are you sure? Did you check with 'ps', searching qemu processes?

If it's still up, but still using a local IP address, you can find it
by searching the hosted-engine logs for 'local_vm_ip' and login there
from the host.

Here are the ERROR messages I got on the logs copied on the host:

engine.log:2020-07-08 15:05:04,178+02 ERROR [org.ovirt.engine.core.bll.pm.FenceProxyLocator] (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-89) [45a7e7f3] Can not run fence action on host '<erased_for_privacy>', no suitable proxy host was found.
That's ok.

server.log:2020-07-08 15:09:23,081+02 ERROR [org.jboss.resteasy.resteasy_jaxrs.i18n] (default task-1) RESTEASY002010: Failed to execute: javax.ws.rs.WebApplicationException: HTTP 404 Not Found
server.log:2020-07-08 15:14:19,804+02 ERROR [org.jboss.resteasy.resteasy_jaxrs.i18n] (default task-1) RESTEASY002010: Failed to execute: javax.ws.rs.WebApplicationException: HTTP 404 Not Found
This probably indicates a problem, but I agree it's not very helpful.

grep: setup: Is a directory
Right - so please search inside it.

Also please check the hosted-engine deploy logs themselves.

Not very helpful.




I simply can't figure out what file is missing.
If, as a test, I try to install the HE without restoring the backup, the installation goes smoothly to the end, but at that point I can't restore the backup, as far as I can understand.

Another option is to do the restore manually. To find relevant information, search the net for "enginevm_before_engine_setup".


Later I will give it a try.
Good luck and best regards,

-- 
Andrea Chierici - INFN-CNAF 			
Viale Berti Pichat 6/2, 40127 BOLOGNA
Office Tel: +39 051 2095463	
SkypeID ataruz
--