HostedEngine install failure (oVirt 4.4 & oVirt Node OS)

Hello folks, Hoping I can trace this down here but kind of "out of the box" error going on here. Steps: - Install oVirt Node OS - Manual steps using ovirt-hosted-engine-setup Might be a step I glanced over so I'm alright with a finger point and RTFM statement. ;-) Process fails out: [ INFO ] TASK [ovirt.hosted_engine_setup : Obtain SSO token using username/password credentials] [ INFO ] ok: [localhost] [ INFO ] TASK [ovirt.hosted_engine_setup : Wait for the host to be up] [ ERROR ] fatal: [localhost]: FAILED! => {"attempts": 120, "changed": false, "ovirt_hosts": [{"address": "mtl-hv-14.teve.inc", "affinity _labels": [], "auto_numa_status": "unknown", "certificate": {"organization": "teve.inc", "subject": "O=teve.inc,CN=mtl-hv-14.teve.inc"}, "cluster": {"href": "/ovirt-engine/api/clusters/ba6daa62-b1a5-11ea-a207-00163e79d98c", "id": "ba6daa62-b1a5-11ea-a207-00163e79d98c"}, " comment": "", "cpu": {"speed": 0.0, "topology": {}}, "device_passthrough": {"enabled": false}, "devices": [], "external_network_provider _configurations": [], "external_status": "ok", "hardware_information": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engin e/api/hosts/e1399963-f520-4bdc-8ef0-832dc3d99ece", "id": "e1399963-f520-4bdc-8ef0-832dc3d99ece", "katello_errata": [], "kdump_status": " unknown", "ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "name": "mtl-hv-14.teve.inc", "network_attachments": [], " nics": [], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_cmdline": ""}, "permissions": [], "port": 54321, "power_management": {"automatic_pm_enabled": true, "enabled": false, "kdump_detection": true, "pm_proxies": []}, "protocol": "stomp", "se_linux": $}, "spm": {"priority": 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:rfVGiGz8dQU7Hr5irbd8N+xBkj94qWThArTokcSqGV8", "port": 22}, $statistics": [], "status": "install_failed", "storage_connection_extensions": [], "summary": {"total": 0}, "tags": [], "transparent_hug$_pages": {"enabled": false}, "type": "rhel", "unmanaged_networks": [], "update_available": false, "vgpu_placement": "consolidated"}]} ... [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The system may not be provisioned according to the playbook results$ please check the logs for the issue, fix accordingly or re-deploy from scratch.\n"} [ ERROR ] Failed to execute stage 'Closing up': Failed executing ansible-playbook I've attached the ovirt-hosted-engine-setup log. *Thank you,* *Ian Easter*

On Mon, Jun 22, 2020 at 9:21 AM Ian Easter <ieaster@telvue.com> wrote:
Hello folks,
Hoping I can trace this down here but kind of "out of the box" error going on here.
Steps: - Install oVirt Node OS - Manual steps using ovirt-hosted-engine-setup
Might be a step I glanced over so I'm alright with a finger point and RTFM statement. ;-)
Process fails out: [ INFO ] TASK [ovirt.hosted_engine_setup : Obtain SSO token using username/password credentials] [ INFO ] ok: [localhost] [ INFO ] TASK [ovirt.hosted_engine_setup : Wait for the host to be up]
This ^^^^ is the task that failed. The deploy process asks the engine to add the host, then polls the engine waiting until the host appears as Up. For you, it timed out. Please check/share all of the directory /var/log/ovirt-hosted-engine-setup, to try to find why. If engine-logs-* inside it is empty, you might try to get the engine logs from the engine VM itself - you can find its IP address by searching the logs for "local_vm_ip" and ssh to it from the host. Best regards,
[ ERROR ] fatal: [localhost]: FAILED! => {"attempts": 120, "changed": false, "ovirt_hosts": [{"address": "mtl-hv-14.teve.inc", "affinity _labels": [], "auto_numa_status": "unknown", "certificate": {"organization": "teve.inc", "subject": "O=teve.inc,CN=mtl-hv-14.teve.inc"}, "cluster": {"href": "/ovirt-engine/api/clusters/ba6daa62-b1a5-11ea-a207-00163e79d98c", "id": "ba6daa62-b1a5-11ea-a207-00163e79d98c"}, " comment": "", "cpu": {"speed": 0.0, "topology": {}}, "device_passthrough": {"enabled": false}, "devices": [], "external_network_provider _configurations": [], "external_status": "ok", "hardware_information": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engin e/api/hosts/e1399963-f520-4bdc-8ef0-832dc3d99ece", "id": "e1399963-f520-4bdc-8ef0-832dc3d99ece", "katello_errata": [], "kdump_status": " unknown", "ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "name": "mtl-hv-14.teve.inc", "network_attachments": [], " nics": [], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_cmdline": ""}, "permissions": [], "port": 54321, "power_management": {"automatic_pm_enabled": true, "enabled": false, "kdump_detection": true, "pm_proxies": []}, "protocol": "stomp", "se_linux": $}, "spm": {"priority": 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:rfVGiGz8dQU7Hr5irbd8N+xBkj94qWThArTokcSqGV8", "port": 22}, $statistics": [], "status": "install_failed", "storage_connection_extensions": [], "summary": {"total": 0}, "tags": [], "transparent_hug$_pages": {"enabled": false}, "type": "rhel", "unmanaged_networks": [], "update_available": false, "vgpu_placement": "consolidated"}]} ... [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The system may not be provisioned according to the playbook results$ please check the logs for the issue, fix accordingly or re-deploy from scratch.\n"} [ ERROR ] Failed to execute stage 'Closing up': Failed executing ansible-playbook
I've attached the ovirt-hosted-engine-setup log.
Thank you, Ian Easter
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/QXT3G3THYA6MYQ...
-- Didi

I appreciate the follow up! I had determined that the /tmp mount did not have suitable space. The 1G allocation didn't seem suitable for the HostedEngine deployment. I have hit a new fail point, though, that I need to investigate to understand a little more.

These may have just been edge cases but the second issue seemed to simply time out while on the "Wait for the host to be up" task. Jobs ran inside of HostedEngine and the retries expired before it was completed. I needed to edit line 147, `retries: 120`, to a higher number so the process would complete and return the expected check result. (For reference, I made it 999 - just arbitrary) File requiring edit: /usr/share/ansible/roles/ovirt.hosted_engine_setup/tasks/bootstrap_local_vm/05_add_host.yml I did not run into the space limit on `/tmp` mount again so this _may have_ been from previous attempt just clogging that space up. *Thank you,* *Ian Easter* On Tue, Jun 23, 2020 at 10:06 AM <ieaster@telvue.com> wrote:
I appreciate the follow up!
I had determined that the /tmp mount did not have suitable space. The 1G allocation didn't seem suitable for the HostedEngine deployment.
I have hit a new fail point, though, that I need to investigate to understand a little more. _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/5KP45ZQVFZ4SWB...
participants (3)
-
Ian Easter
-
ieaster@telvue.com
-
Yedidyah Bar David