On Tue, Oct 11, 2022 at 4:54 AM andrea.crisanti--- via Users
<users(a)ovirt.org> wrote:
Hy,
I am trying to install ovirt 4.5 on a 4-host cluster running Centos Stream 8, but the
engine does not start and the whole process fails.
Here is my procedure
dnf install centos-release-ovirt45
dnf module reset virt
dnf module enable virt:rhel
dnf install ovirt-engine-appliance
dnf install ovirt-hosted-engine-setup
The latest version of ansible [ansible-core 2.13] uses python3.9 and the installation
fails because some python3.9 modules are missing
[python39-netaddr, python39-jmespath] and cannot be installed [conflict
python3-jmespath]. So I downgraded ansible to ansible-core 2.12
dnf downgrade ansible-core
+1
Now
hosted-engine-setup --deploy --4
goes proceed further but stops because it cannot start the engine
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Wait for the host to be up]
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Notify the user about a failure]
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
"Host is not up, please check logs, perhaps also on the engine machine"}
Please note the error message - "Host is not up", not "engine is not
up".
This means that the deploy process did setup and start an engine, then
tried to add the host (the one you deploy on) to the engine, and this
failed somewhere.
I looked into the log file
/var/log//ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-ansible-bootstrap_local_vm-20221007132728-yp7cd1.log
and I found the following error:
2022-10-07 13:28:30,881+0200 ERROR ansible failed {
"ansible_host": "localhost",
"ansible_playbook":
"/usr/share/ovirt-hosted-engine-setup/he_ansible/trigger_role.yml",
"ansible_result": {
"_ansible_no_log": false,
"changed": false,
"cmd": [
"virsh",
"net-undefine",
"default"
],
"delta": "0:00:00.039258",
"end": "2022-10-07 13:28:30.710401",
"invocation": {
"module_args": {
"_raw_params": "virsh net-undefine default",
"_uses_shell": false,
"argv": null,
"chdir": null,
"creates": null,
"executable": null,
"removes": null,
"stdin": null,
"stdin_add_newline": true,
"strip_empty_ends": true,
"warn": false
}
},
"msg": "non-zero return code",
"rc": 1,
"start": "2022-10-07 13:28:30.671143",
"stderr": "error: failed to get network 'default'\nerror:
Network not found: no network with matching name 'default'",
"stderr_lines": [
"error: failed to get network 'default'",
"error: Network not found: no network with matching name
'default'"
],
"stdout": "",
"stdout_lines": []
},
"ansible_task": "Update libvirt default network configuration,
undefine",
This isn't your issue. If you check the code, you see that this task
has "ignore_errors: true", and actually failure there is almost always
expected.
"ansible_type": "task",
"status": "FAILED",
"task_duration": 0
}
Needless to say
firewalld and libvirtd are both up
and virsh net-list gives:
Name State Autostart Persistent
------------------------------------------------
;vdsmdummy; active no no
default active no yes
I googled around without success.
Has anyone had similar problems?
Please check/share /var/log/ovirt-engine on the engine machine trying
to understand why adding the host failed.
The engine machine, at this point, is still local to your host, having
a private IP address - you can find that one by searching
/var/log/ovirt-hosted-engine-setup/* for 'local_vm_ip'. I usually use
this:
# ssh -o StrictHostKeyChecking=No -o UserKnownHostsFile=/dev/null -l
root $(grep -A20 local_vm_ip $(ls -t
/var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-ansible-bootstrap_local_vm*
| head -1) | sed -n 's/^ "stdout": "\([^"]*\)",/\1/p'
| sed
's/\\n.*//')
Best regards,
--
Didi