Hosted-engine fail and host reboot

I tried with a 1 baremetal host, 3 baremetals or virtual hosts and I still have the same problem for installing the hosted-engine. Hyperconverge installs well. I have tried with multiple version of ovirt ISO file 4.4.1 and 4.4.4 and 4.4.6. when I run hosted-engine --deploy or with the cockpit, it creates a temporary VM in the 192.168.222.x subnet and I am able to connect in ssh on this temporary ip. When the script displays "TASK [ovirt.ovirt.hoted_engine_setup]: Wait for the host to be up" the server reboots and I have nothing left. problem when creation of the ovirtmgmt? here all my log https://drive.google.com/drive/folders/1kFFSlIqbjVwSN8t88aQZZR45DYEHPUvt?usp... 021-05-26 10:15:09,135-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : Always revoke the SSO token] 2021-05-26 10:15:10,439-0400 DEBUG otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:105 {'msg': "You must specify either 'url' or 'hostname'.", 'invocation': {'module_args': {'state': 'absent', 'ovirt_auth': {'changed': False, 'ansible_facts': {'ovirt_auth': {'token': 'Mz2onwB7qWX2x8HnJVgetQIQ9U4eVziRt8TEabfoizI2B98d0PDp-yxTU92a9lbun2vcr_i5yOXRsJKJKhqkVw', 'url': 'https://oe.telecom.lan/ovirt-engine/api', 'ca_file': None, 'insecure': True, 'timeout': 0, 'compress': True, 'kerberos': False, 'headers': None}}, 'failed': False, 'attempts': 1}, 'timeout': 0, 'compress': True, 'kerberos': False, 'url': None, 'hostname': None, 'username': None, 'password': None, 'ca_file': None, 'insecure': None, 'headers': None, 'token': None}}, '_ansible_no_log': False, 'changed': False} 2021-05-26 10:15:10,540-0400 DEBUG otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:105 ignored: [localhost]: FAILED! => {"changed": false, "msg": "You must specify either 'url' or 'hostname'."} 2021-05-26 10:15:11,643-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : include_tasks] 2021-05-26 10:15:12,647-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 ok: [localhost] 2021-05-26 10:15:13,851-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : Obtain SSO token using username/password credentials] 2021-05-26 10:15:15,261-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 ok: [localhost] 2021-05-26 10:15:17,275-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : Wait for the host to be up] 2021-05-26 10:22:54,758-0400 ERROR otopi.context context._executeMethod:154 Failed to execute stage 'Closing up': SIG1 2021-05-26 10:22:54,762-0400 DEBUG otopi.context context.dumpEnvironment:765 ENVIRONMENT DUMP - BEGIN 2021-05-26 10:22:54,763-0400 DEBUG otopi.context context.dumpEnvironment:775 ENV BASE/error=bool:'True' 2021-05-26 10:22:54,763-0400 DEBUG otopi.context context.dumpEnvironment:775 ENV BASE/exceptionInfo=list:'[(<class 'RuntimeError'>, RuntimeError('SIG1',), <traceback object at 0x7fa8fc1ca1c8>)]' thank you

On Wed, May 26, 2021 at 7:13 PM Dominique D <dominique.deschenes@gcgenicom.com> wrote:
I tried with a 1 baremetal host, 3 baremetals or virtual hosts and I still have the same problem for installing the hosted-engine. Hyperconverge installs well.
I have tried with multiple version of ovirt ISO file 4.4.1 and 4.4.4 and 4.4.6.
when I run hosted-engine --deploy or with the cockpit, it creates a temporary VM in the 192.168.222.x subnet and I am able to connect in ssh on this temporary ip. When the script displays "TASK [ovirt.ovirt.hoted_engine_setup]: Wait for the host to be up" the server reboots and I have nothing left.
Sounds like it should have been fixed with [1]. Are you sure this happens also with 4.4.6? [1] https://github.com/oVirt/ovirt-ansible-collection/pull/218 Best regards, -- Didi

it seems to be this problem I tried to install it again with version 4.4.6-2021051809 and I get this message. [ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"}

Maybe you can remove 6900/tcp from firewalld and try again ? Best Regards,Strahil Nikolov On Thu, May 27, 2021 at 19:43, Dominique D<dominique.deschenes@gcgenicom.com> wrote: it seems to be this problem I tried to install it again with version 4.4.6-2021051809 and I get this message. [ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"} _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/DASEBWHNO2RT2Q...

Hi Strahil, I did that and it worked. firewall-cmd --zone=public --remove-port=6900/tcp firewall-cmd --runtime-to-permanent hosted-engine --deploy Thank you Dominique ----- Message reçu ----- De: Strahil Nikolov (hunter86_bg@yahoo.com) Date: 28/05/21 14:10 À: Dominique D (dominique.deschenes@gcgenicom.com), users@ovirt.org Objet: Re: [ovirt-users] Re: Hosted-engine fail and host reboot Maybe you can remove 6900/tcp from firewalld and try again ? Best Regards, Strahil Nikolov On Thu, May 27, 2021 at 19:43, Dominique D <dominique.deschenes@gcgenicom.com> wrote: it seems to be this problem I tried to install it again with version 4.4.6-2021051809 and I get this message. [ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"} _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/DASEBWHNO2RT2Q...

On Sat, May 29, 2021 at 7:03 PM Dominique Deschênes <dominique.deschenes@gcgenicom.com> wrote:
Hi Strahil,
I did that and it worked.
Thanks for the report. This looks identical to a similar case from a few weeks ago [1]. Any chance you can try checking what/who did this change to your firewall conf prior to deployment? It sounds like a new change somewhere. Thanks and best regards, [1] https://lists.ovirt.org/archives/list/users@ovirt.org/thread/5SEB6PJCFTLXKOI...
firewall-cmd --zone=public --remove-port=6900/tcp firewall-cmd --runtime-to-permanent hosted-engine --deploy
Thank you
Dominique
----- Message reçu ----- ________________________________ De: Strahil Nikolov (hunter86_bg@yahoo.com) Date: 28/05/21 14:10 À: Dominique D (dominique.deschenes@gcgenicom.com), users@ovirt.org Objet: Re: [ovirt-users] Re: Hosted-engine fail and host reboot
Maybe you can remove 6900/tcp from firewalld and try again ?
Best Regards, Strahil Nikolov
On Thu, May 27, 2021 at 19:43, Dominique D <dominique.deschenes@gcgenicom.com> wrote: it seems to be this problem
I tried to install it again with version 4.4.6-2021051809 and I get this message.
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"}
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/DASEBWHNO2RT2Q...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/MLJTBXKTTTI2KE...
-- Didi

Hi, I tried to deploy hosted-engine without deploying gluster (Hyperconverged) and I did not need to remove 6900/tcp. Dominique Deschênes Ingénieur chargé de projets, Responsable TI 816, boulevard Guimond, Longueuil J4G 1T5 450 670-8383 x105 450 670-2259 ----- Message reçu ----- De: Yedidyah Bar David (didi@redhat.com) Date: 31/05/21 02:36 À: Dominique Deschênes (dominique.deschenes@gcgenicom.com) Cc: Strahil Nikolov (hunter86_bg@yahoo.com), users (users@ovirt.org) Objet: Re: [ovirt-users] Re: Hosted-engine fail and host reboot On Sat, May 29, 2021 at 7:03 PM Dominique Deschênes <dominique.deschenes@gcgenicom.com> wrote:
Hi Strahil,
I did that and it worked.
Thanks for the report. This looks identical to a similar case from a few weeks ago [1]. Any chance you can try checking what/who did this change to your firewall conf prior to deployment? It sounds like a new change somewhere. Thanks and best regards, [1] https://lists.ovirt.org/archives/list/users@ovirt.org/thread/5SEB6PJCFTLXKOI...
firewall-cmd --zone=public --remove-port=6900/tcp firewall-cmd --runtime-to-permanent hosted-engine --deploy
Thank you
Dominique
----- Message reçu ----- ________________________________ De: Strahil Nikolov (hunter86_bg@yahoo.com) Date: 28/05/21 14:10 À: Dominique D (dominique.deschenes@gcgenicom.com), users@ovirt.org Objet: Re: [ovirt-users] Re: Hosted-engine fail and host reboot
Maybe you can remove 6900/tcp from firewalld and try again ?
Best Regards, Strahil Nikolov
On Thu, May 27, 2021 at 19:43, Dominique D <dominique.deschenes@gcgenicom.com> wrote: it seems to be this problem
I tried to install it again with version 4.4.6-2021051809 and I get this message.
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"}
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/DASEBWHNO2RT2Q...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/MLJTBXKTTTI2KE...
-- Didi

The linked report was from me. The port that was failing for me was originally 5900. I deleted the rule from firewalld but then ran the installation again and it errored on 6900. I had to remove both. Gluster was configured during the HCI method *From:* Dominique Deschênes <dominique.deschenes@gcgenicom.com> *Sent:* Monday, May 31, 2021 4:08 PM *To:* Yedidyah Bar David <didi@redhat.com> *Cc:* Strahil Nikolov <hunter86_bg@yahoo.com>; users <users@ovirt.org> *Subject:* [ovirt-users] Re: Hosted-engine fail and host reboot Hi, I tried to deploy hosted-engine without deploying gluster (Hyperconverged) and I did not need to remove 6900/tcp. *Dominique Deschênes* Ingénieur chargé de projets, Responsable TI 816, boulevard Guimond, Longueuil J4G 1T5 450 670-8383 x105 450 670-2259 <http://www.gcgenicom.com/> <http://www.linkedin.com/company/groupe-conseil-g-nicom> <http://twitter.com/gcgenicom> ----- Message reçu ----- ------------------------------ De: Yedidyah Bar David (didi@redhat.com) Date: 31/05/21 02:36 À: Dominique Deschênes (dominique.deschenes@gcgenicom.com) Cc: Strahil Nikolov (hunter86_bg@yahoo.com), users (users@ovirt.org) Objet: *Re: [ovirt-users] Re: Hosted-engine fail and host reboot* On Sat, May 29, 2021 at 7:03 PM Dominique Deschênes <dominique.deschenes@gcgenicom.com> wrote:
Hi Strahil,
I did that and it worked.
Thanks for the report. This looks identical to a similar case from a few weeks ago [1]. Any chance you can try checking what/who did this change to your firewall conf prior to deployment? It sounds like a new change somewhere. Thanks and best regards, [1] https://lists.ovirt.org/archives/list/users@ovirt.org/thread/5SEB6PJCFTLXKOI...
firewall-cmd --zone=public --remove-port=6900/tcp firewall-cmd --runtime-to-permanent hosted-engine --deploy
Thank you
Dominique
----- Message reçu ----- ________________________________ De: Strahil Nikolov (hunter86_bg@yahoo.com) Date: 28/05/21 14:10 À: Dominique D (dominique.deschenes@gcgenicom.com), users@ovirt.org Objet: Re: [ovirt-users] Re: Hosted-engine fail and host reboot
Maybe you can remove 6900/tcp from firewalld and try again ?
Best Regards, Strahil Nikolov
On Thu, May 27, 2021 at 19:43, Dominique D <dominique.deschenes@gcgenicom.com> wrote: it seems to be this problem
I tried to install it again with version 4.4.6-2021051809 and I get this
message.
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
"ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"}
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
List Archives:
https://lists.ovirt.org/archives/list/users@ovirt.org/message/DASEBWHNO2RT2Q...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/MLJTBXKTTTI2KE...
-- Didi = -- *Notice to Recipient*: https://www.fixflyer.com/disclaimer <https://www.fixflyer.com/disclaimer>

On Mon, May 31, 2021 at 11:15 PM Charles Kozler <charles@fixflyer.com> wrote:
The linked report was from me. The port that was failing for me was originally 5900. I deleted the rule from firewalld but then ran the installation again and it errored on 6900. I had to remove both. Gluster was configured during the HCI method
Thanks for the clarification.
*From:* Dominique Deschênes <dominique.deschenes@gcgenicom.com> *Sent:* Monday, May 31, 2021 4:08 PM *To:* Yedidyah Bar David <didi@redhat.com> *Cc:* Strahil Nikolov <hunter86_bg@yahoo.com>; users <users@ovirt.org> *Subject:* [ovirt-users] Re: Hosted-engine fail and host reboot
Hi,
I tried to deploy hosted-engine without deploying gluster (Hyperconverged) and I did not need to remove 6900/tcp.
No? But you wrote below that you removed it and then it worked. I might be missing something.
*Dominique Deschênes* Ingénieur chargé de projets, Responsable TI 816, boulevard Guimond, Longueuil J4G 1T5 450 670-8383 x105 450 670-2259
<http://www.gcgenicom.com/> <http://www.linkedin.com/company/groupe-conseil-g-nicom> <http://twitter.com/gcgenicom>
----- Message reçu ----- ------------------------------
De: Yedidyah Bar David (didi@redhat.com) Date: 31/05/21 02:36 À: Dominique Deschênes (dominique.deschenes@gcgenicom.com) Cc: Strahil Nikolov (hunter86_bg@yahoo.com), users (users@ovirt.org) Objet: *Re: [ovirt-users] Re: Hosted-engine fail and host reboot*
On Sat, May 29, 2021 at 7:03 PM Dominique Deschênes <dominique.deschenes@gcgenicom.com> wrote:
Hi Strahil,
I did that and it worked.
Thanks for the report.
This looks identical to a similar case from a few weeks ago [1].
Any chance you can try checking what/who did this change to your firewall conf prior to deployment?
It sounds like a new change somewhere.
Adding Ritesh. Any idea? This is the second report we got about port 6900 already being configured in firewalld, thus failing the deployment. Best regards,
Thanks and best regards,
[1] https://lists.ovirt.org/archives/list/users@ovirt.org/thread/5SEB6PJCFTLXKOI...
firewall-cmd --zone=public --remove-port=6900/tcp firewall-cmd --runtime-to-permanent hosted-engine --deploy
Thank you
Dominique
----- Message reçu ----- ________________________________ De: Strahil Nikolov (hunter86_bg@yahoo.com) Date: 28/05/21 14:10 À: Dominique D (dominique.deschenes@gcgenicom.com), users@ovirt.org Objet: Re: [ovirt-users] Re: Hosted-engine fail and host reboot
Maybe you can remove 6900/tcp from firewalld and try again ?
Best Regards, Strahil Nikolov
On Thu, May 27, 2021 at 19:43, Dominique D <dominique.deschenes@gcgenicom.com> wrote: it seems to be this problem
I tried to install it again with version 4.4.6-2021051809 and I get this
message.
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on
firewalld]
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED: '6900:tcp' already in 'public' Non-permanent operation"}
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives:
https://lists.ovirt.org/archives/list/users@ovirt.org/message/DASEBWHNO2RT2Q...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/MLJTBXKTTTI2KE...
-- Didi
=
*Notice to Recipient*: https://www.fixflyer.com/disclaimer
-- Didi

In https://github.com/gluster/gluster-ansible-infra there is an example with :vars: # Firewall setup gluster_infra_fw_ports: - 5900-6923/tcp Maybe that's causing the problem ? Best Regards,Strahil Nikolov

Yes, I think so Dominique Deschênes ----- Message reçu ----- De: Strahil Nikolov via Users (users@ovirt.org) Date: 02/06/21 13:02 À: Dominique Deschênes (dominique.deschenes@gcgenicom.com), Yedidyah Bar David (didi@redhat.com) Cc: users (users@ovirt.org) Objet: [ovirt-users] Re: Hosted-engine fail and host reboot In https://github.com/gluster/gluster-ansible-infra there is an example with : vars: # Firewall setup gluster_infra_fw_ports: - 5900-6923/tcp Maybe that's causing the problem ? Best Regards, Strahil Nikolov _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/BOEHHMVR3OBA6G...

If it is, then I think it's either a bug in the firewalld module or in firewalld itself. Best Regards,Strahil Nikolov On Fri, Jun 4, 2021 at 21:16, Dominique Deschênes<dominique.deschenes@gcgenicom.com> wrote: _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/PTE7IJNQECUG72...

Adding Parth, Parth do you have anything to say here. On Fri, Jun 4, 2021 at 11:59 PM Strahil Nikolov via Users <users@ovirt.org> wrote:
If it is, then I think it's either a bug in the firewalld module or in firewalld itself.
Best Regards, Strahil Nikolov
On Fri, Jun 4, 2021 at 21:16, Dominique Deschênes <dominique.deschenes@gcgenicom.com> wrote: _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/PTE7IJNQECUG72...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SELTQYP77I73KC...
participants (6)
-
Charles Kozler
-
Dominique D
-
Dominique Deschênes
-
Ritesh Chikatwar
-
Strahil Nikolov
-
Yedidyah Bar David