fresh ovirt node 4.4.6 fail on firewalld both host and engine deployment
by Charles Kozler
Hello -
Deployed fresh ovirt node 4.4.6 and the only thing I did to the system was
configure the NIC with nmtui
During the gluster install the install errored out with
gluster-deployment-1620832547044.log:failed: [n2] (item=5900/tcp) =>
{"ansible_loop_var": "item", "changed": false, "item": "5900/tcp", "msg":
"ERROR: Exception caught: org.fedoraproject.FirewallD1.Exception:
ALREADY_ENABLED: '5900:tcp' already in 'public' Permanent and
Non-Permanent(immediate) operation"}
The fix here was easy - I just deleted the port it was complaining about
with firewall-cmd and restarted the installation and it was all fine
During the hosted engine deployment when the VM is being deployed it dies
here
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Open a port on firewalld]
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "ERROR:
Exception caught: org.fedoraproject.FirewallD1.Exception: ALREADY_ENABLED:
'6900:tcp' already in 'public' Non-permanent operation"}
Now the issue here is that I do not have access to the engine VM as it is
in a bit of a transient state since when it fails the current image that is
open is discarded when the ansible playbook is kicked off again
I cannot find any BZ on this and google is turning up nothing. I don't
think firewalld failing due to the firewall rule already existing should be
a reason to exit the installation
The interesting part is that this only fails on certain ports. i.e when I
reran the gluster wizard after 5900 failed, the other ports are presumably
still added to the firewall, and the installation completes
Suggestions?
--
*Notice to Recipient*: https://www.fixflyer.com/disclaimer
<https://www.fixflyer.com/disclaimer>
3 years, 9 months
I am installing ovirt engine 4.3.10
by ken@everheartpartners.com
I am getting this error message when I install it on CentOS 7.9 when running the hosted engine setup.
[ INFO ] TASK [ovirt.hosted_engine_setup : Validate selected bridge interface if management bridge does not exists]
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The selected network interface is not valid"}
I have two interfaces
enp6s0
enp11s0
Enp11s0 is the public network
enp6s0 is the storage network to the netapp.
Any idea how to resolve this?
3 years, 10 months
Fedora CoreOS
by lejeczek
Hi guys.
From what I gather there is no oVirt for Fedora CoreOS but
I should ask here at the source - is it there oVirt for that
OS and if there is not as of now, are the any plans or
discussion to make that reality?
many thanks, L.
3 years, 10 months
Data Centers status Not Operational
by nexpron@gmail.com
Hi everyone,
I found 3 servers (HV1, HV2, HV3) that contains VMs at another servers room. These servers are hosts based on KVM.
I logged into oVirt Engine Web Administration. Data Centers tab show me only one entry
Name: RCV
Storage: Shared
Status: Not Operational
Compability Version: 3.4
Description: [Empty]
Webbrowser oVirt Engine Web Administrator show me
oVirt Engine Version: 3.4.0-1.el6
One VM on HV3 has stopped, Hosts tab show Non Responsive Status for every host (HV1-3); every VM in Virtual Machines tab show Unknown status.
What do I'll do to change status in Data Center? How to start debug the reason? Last administrator leave documentation in vestigial form :)
DataCenter: RCV
-->Cluster: RCV_Cluster
---->Host: HV1 node
---->Host: Hv2 node
---->Host: HV3 engine
Best regards,
nexpron
3 years, 10 months
Hosted-engine fail and host reboot
by Dominique D
I tried with a 1 baremetal host, 3 baremetals or virtual hosts and I still have the same problem for installing the hosted-engine. Hyperconverge installs well.
I have tried with multiple version of ovirt ISO file 4.4.1 and 4.4.4 and 4.4.6.
when I run hosted-engine --deploy or with the cockpit, it creates a temporary VM in the 192.168.222.x subnet and I am able to connect in ssh on this temporary ip. When the script displays "TASK [ovirt.ovirt.hoted_engine_setup]: Wait for the host to be up" the server reboots and I have nothing left.
problem when creation of the ovirtmgmt?
here all my log https://drive.google.com/drive/folders/1kFFSlIqbjVwSN8t88aQZZR45DYEHPUvt?...
021-05-26 10:15:09,135-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : Always revoke the SSO token]
2021-05-26 10:15:10,439-0400 DEBUG otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:105 {'msg': "You must specify either 'url' or 'hostname'.", 'invocation': {'module_args': {'state': 'absent', 'ovirt_auth': {'changed': False, 'ansible_facts': {'ovirt_auth': {'token': 'Mz2onwB7qWX2x8HnJVgetQIQ9U4eVziRt8TEabfoizI2B98d0PDp-yxTU92a9lbun2vcr_i5yOXRsJKJKhqkVw', 'url': 'https://oe.telecom.lan/ovirt-engine/api', 'ca_file': None, 'insecure': True, 'timeout': 0, 'compress': True, 'kerberos': False, 'headers': None}}, 'failed': False, 'attempts': 1}, 'timeout': 0, 'compress': True, 'kerberos': False, 'url': None, 'hostname': None, 'username': None, 'password': None, 'ca_file': None, 'insecure': None, 'headers': None, 'token': None}}, '_ansible_no_log': False, 'changed': False}
2021-05-26 10:15:10,540-0400 DEBUG otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:105 ignored: [localhost]: FAILED! => {"changed": false, "msg": "You must specify either 'url' or 'hostname'."}
2021-05-26 10:15:11,643-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : include_tasks]
2021-05-26 10:15:12,647-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 ok: [localhost]
2021-05-26 10:15:13,851-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : Obtain SSO token using username/password credentials]
2021-05-26 10:15:15,261-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 ok: [localhost]
2021-05-26 10:15:17,275-0400 INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : Wait for the host to be up]
2021-05-26 10:22:54,758-0400 ERROR otopi.context context._executeMethod:154 Failed to execute stage 'Closing up': SIG1
2021-05-26 10:22:54,762-0400 DEBUG otopi.context context.dumpEnvironment:765 ENVIRONMENT DUMP - BEGIN
2021-05-26 10:22:54,763-0400 DEBUG otopi.context context.dumpEnvironment:775 ENV BASE/error=bool:'True'
2021-05-26 10:22:54,763-0400 DEBUG otopi.context context.dumpEnvironment:775 ENV BASE/exceptionInfo=list:'[(<class 'RuntimeError'>, RuntimeError('SIG1',), <traceback object at 0x7fa8fc1ca1c8>)]'
thank you
3 years, 10 months
Adding a Ubuntu Host's NFS share to oVirt
by David White
Hello,
Is it possible to use Ubuntu to share an NFS export with oVirt?I'm trying to setup a Backup Domain for my environment.
I got to the point of actually adding the new Storage Domain.
When I click OK, I see the storage domain appear momentarily before disappearing, at which point I get a message about oVirt not being able to obtain a lock.
It appears I'm running into the issue described in this thread: https://lists.ovirt.org/archives/list/users@ovirt.org/thread/BNVX.... Although the actual export is ext4, not xfs.
From what I'm reading on that thread and elsewhere, it sounds like this problem is a result of SELinux not being present, is that correct?
Is my only option here to install an OS that supports SELinux?
Sent with ProtonMail Secure Email.
3 years, 10 months
Can't remove snapshot
by David Johnson
Hi all,
I patched one of my Windows VM's yesterday. I started by snapshotting the
VM, then applied the Windows update. Now that the patch has been tested, I
want to remove the snapshot. I get this message:
Error while executing action:
win-sql-2019:
- Cannot remove Snapshot. The following attached disks are in ILLEGAL
status: win-2019-tmpl_Disk1 - please remove them and try again.
Does anyone have any thoughts how to recover from this? I really don't want
to keep this snapshot hanging around.
Thanks in advance,
*David Johnson*
3 years, 10 months
Cannot add new host to Ovirt
by pablo@miami.edu
I have an Ovirt installation with a hosted engine and three hosts. Using Gluster as the storage for the VMs.
Ovirt: 4.4.6.7
Hosts: CentOS Stream release 8 (updated to latest)
So far so good.
I am trying to add a new host to the cluster with the same OS and hardware as the others and I cannot get it to install, it gives me all kind of errors and it will not install.
I reinstalled the OS and I am getting the same results.
DNS is configured properly and working ok for all hosts.
I can see this error in this log file ansible-runner-service.log:
2021-05-30 15:46:38,319 - runner_service.services.hosts - ERROR - SSH - NOAUTH:SSH auth error - passwordless ssh not configured for 'ovirt4'
(sshd is configured exactly the same as all other hosts and I can login to this host without a password from the ovirt hosted engine)
I see these errors in the log engine.log:
2021-05-30 16:22:35,166Z ERROR [org.ovirt.vdsm.jsonrpc.client.reactors.Reactor] (SSL Stomp Reactor) [] Unable to process messages PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target
2021-05-30 16:22:35,175Z ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-32) [] EVENT_ID: VDS_BROKER_COMMAND_FAILURE(10,802), VDSM ovirt4.net.miami.edu command Get Host Capabilities failed: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target
2021-05-30 16:22:35,175Z ERROR [org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-32) [] Unable to RefreshCapabilities: VDSNetworkException: VDSGenericException: VDSNetworkException: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target
2021-05-30 16:22:35,597Z ERROR [org.ovirt.engine.core.bll.gluster.GlusterSyncJob] (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-23) [] Error while refreshing server data for cluster 'Default' from database: null
I tried reinstalling, rebooting, put it in maintenance, enroll the certificate, check for Upgrades, rebooted multiple times both the hosts and the ovirt engine:
nothing works.
What am I doing wrong?
Thank you in advance for your help.
3 years, 10 months
OSError: [Errno 24] Too many open files
by lejeczek
Hi guys
I'm trying to install HE on a KVM host and installer cannot
get pass this:
[ ERROR ] OSError: [Errno 24] Too many open files
[ ERROR ] fatal: [localhost]: FAILED! => {"msg": "Unexpected
failure during module execution.", "stdout": ""}
[ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Sync on
engine machine]
[ ERROR ] fatal: [localhost]: FAILED! => {"msg": "Using a
SSH password instead of a key is not possible because Host
Key checking is enabled and sshpass does not support this.
Please add this host's fingerprint to your known_hosts file
to manage this host."}
[ ERROR ] Failed to execute stage 'Closing up': Failed
executing ansible-playbook
[ INFO ] Stage: Clean up
KVM itself should be satisfying in terms of requirements, as
a setup of a HE version from 'master' repo worked previously
on it.
Any ideas & thoughts shared, on what is that cryptic error
message saying and how to troubleshoot I'll very much
appreciate.
many thanks, L.
3 years, 10 months