September 2022 - Users - oVirt List Archives

Hosted-Engine VM wont start after physical CPU change
by douglasddr8＠gmail.com 24 Sep '22

24 Sep '22

I currently have a selfhosted engine with two cascadelake xeon silver 4208 cpu with 8 cores I changed both to two skylake xeon platinum 8160 with 24cores After initialization the vm hosted-engine wont start, I found in the logs that the cpu is not compatible because its not contain the avx512vnni flag Is it possible to work around the problem or these processors cannot be used?

1 0

Certificate expiration
by Joseph Gelinas 23 Sep '22

23 Sep '22

Hi, The certificates on our oVirt stack recently expired, while all the VMs are still up, I can't put the cluster into global maintenance via ovirt-engine, or do anything via ovirt-engine for that matter. Just get event logs about cert validity. VDSM ovirt-1.xxxxx.com command Get Host Capabilities failed: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed VDSM ovirt-2.xxxxx.com command Get Host Capabilities failed: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed VDSM ovirt-3.xxxxx.com command Get Host Capabilities failed: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed Under Compute -> Hosts, all are status Unassigned. Default data center is status Non Responsive. I have tried a couple of solutions to regenerate the certificates without much luck and have copied the originals back in place. https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.3/ht… https://access.redhat.com/solutions/2409751 I have seen things saying running engine-setup will generate new certs, however engine doesn't think the cluster is in global maintenance so won't run that, I believe I can get around the check with `engine-setup --otopi-environment=OVESETUP_CONFIG/continueSetupOnHEVM=bool:True` but is that the right thing to do? Will it deploy the certs on to the hosts as well so things communicate properly? Looks like one is supposed to put a node into maintenance and reenroll it after doing the engine-setup, but will it even be able to put the nodes into maintenance given I can't do anything with them now? Appreciate any ideas.

3 18

QXL vs VGA
by Andrea Chierici 23 Sep '22

23 Sep '22

Dear all, on latest 4.5, as many others have noticed, default is to use video type VGA and VNC as graphics protocol. Problem is, in my current environments, I can't seem to make this to work. When the machine power ups I get the "usual" error from virt-viewer: I found some docs on the web, like this from redhat: https://access.redhat.com/solutions/5695951 but it does not seem to work. Indeed I reinstalled a host and forced the creation of the vm on it, but the console is still unreachable. Any hint how I can solve this? I am quite surprised I could not find anything on standard documentation. Thanks, Andrea -- Andrea Chierici - INFN-CNAF Viale Berti Pichat 6/2, 40127 BOLOGNA Office Tel: +39 051 2095463 SkypeID ataruz --

2 2

Failed to deploy ovirt engine with CLI
by Pablo Olivera 23 Sep '22

23 Sep '22

Hi community, I'm trying to deploy the engine via CLI on an ovirt 4.5.2 (CentOS 8) node over a clean install. Previously I was trying to deploy it via cockpit but I got the following error: [ INFO ] TASK [ovirt.ovirt.hosted_engine_setup : Obtain SSO token using username/password credentials] [ ERROR ] ovirtsdk4.AuthError: Error during SSO authentication access_denied : Cannot authenticate user Invalid user credentials. [ ERROR ] fatal: [localhost]: FAILED! => {"attempts": 50, "changed": false, "msg": "Error during SSO authentication access_denied : Cannot authenticate user Invalid user credentials."} After researching in different threads of this forum, I decided to setup it via CLI by recommendation of different users, since it seems that cockpit hosted-engine deployment is broken. I attach the hosted-engine setup log but I am not sure where is the problem now. Can you help me? Thanks in advance. Pablo.

3 5

oVirt bug reports to move from bugzilla to github issues in future
by Michal Skrivanek 23 Sep '22

23 Sep '22

Hi all, as a final stage of out gerrit to github transition that started ~9 months ago we are planning to eliminate the use of bugzilla.redhat.com for all oVirt projects (bugs with Classification: "oVirt") and use the native issue tracking in github as well. We used to have integrations with gerrit and bugzilla that we moved to github actions instead, and the overhead (and notorious slowness) of bugzilla.redhat.com becomes the only "benefit" of using it these days. There's about 50 bugs total left in oVirt bugzilla so it's not that much to move, the biggest change would be that the new bugs are to be filed elsewhere. This is just a heads up for now, we haven't set a cut off date just yet, but you can expect this change in the coming weeks. Thanks, michal

1 1

oVirt API access when using Keycloak
by jonas＠rabe.ch 22 Sep '22

22 Sep '22

Hello all I just recently installed a fresh oVirt appliance. Now I want to use the API to perform some actions but fail to provide the correct username. Can you help? Here's my script: import ovirtsdk4 as sdk from getpass import getpass dc_name = "Default" clu_name = "Default" connection = sdk.Connection( url="https://ovirt-engine-test.admin.int.rabe.ch/ovirt-engine/api", username="admin@ovirt", password=getpass(), # ca_file="/etc/pki/ovirt-engine/ca.pem", insecure=True, ) ... That's the error I get: Traceback (most recent call last): File "/home/user/ovirt_iso_script/./setup_networks.py", line 25, in <module> dc = next(d for d in dcs_svc.list() if d.name == dc_name) File "/usr/lib64/python3.10/site-packages/ovirtsdk4/services.py", line 6863, in list return self._internal_get(headers, query, wait) File "/usr/lib64/python3.10/site-packages/ovirtsdk4/service.py", line 202, in _internal_get context = self._connection.send(request) File "/usr/lib64/python3.10/site-packages/ovirtsdk4/__init__.py", line 371, in send return self.__send(request) File "/usr/lib64/python3.10/site-packages/ovirtsdk4/__init__.py", line 389, in __send self.authenticate() File "/usr/lib64/python3.10/site-packages/ovirtsdk4/__init__.py", line 382, in authenticate self._sso_token = self._get_access_token() File "/usr/lib64/python3.10/site-packages/ovirtsdk4/__init__.py", line 624, in _get_access_token raise AuthError( ovirtsdk4.AuthError: Error during SSO authentication access_denied : Cannot authenticate user No valid profile found in credentials..

2 1

cancel export task - no qemu-img process on host
by goosesk blabla 22 Sep '22

22 Sep '22

Hi, i tried to find way how to cancel hanged export task. I found that there should be some hidden possibility, but it is still not released and i cannot find how to allow it. i found how to remove this task directly from pg database on engine, but i am not sure if this also will unlock that VM. i cannot kill it by qemu-img process, because this process is not running. stucked VM is ubuntu 20.04 with 3 disks 20 + 80 + 300GB . Already running more than 12 hours and still did not start making ova file (checking in export folder) . qemu-img process on the host is missing . i tried to export smaller VM 7GB and it was successfully done by 2 minutes. During export qemu-img process was running on the host. Thank you for any help. BR jan

1 2

Uncaught exception occurred
by nikkognt＠gmail.com 22 Sep '22

22 Sep '22

Hi all, recently I have upgraded from 4.4 to 4.5, since while a create or edit a new pool an error occurr in a pop up red: "Uncaught exception occurred. Please try reloading the page. Details: (TypeError) : Cannot read properties of undefined (reading 'Kh') Please have your administrator check the UI logs" and the pool is create but with wrong settings. I checked the ui.log file: 2022-09-21 16:33:21,382+02 ERROR [org.ovirt.engine.ui.frontend.server.gwt.OvirtRemoteLoggingService] (default task-23) [] Permutation name: 872EE01DC49AC3F81586694F1584D19D 2022-09-21 16:33:21,382+02 ERROR [org.ovirt.engine.ui.frontend.server.gwt.OvirtRemoteLoggingService] (default task-23) [] Uncaught exception: com.google.gwt.core.client.JavaScriptException: (TypeError) : Cannot read properties of undefined (reading 'Kh') at org.ovirt.engine.ui.common.widget.uicommon.storage.DisksAllocationView.$addDiskList(DisksAllocationView.java:190) at org.ovirt.engine.ui.common.widget.uicommon.storage.DisksAllocationView.$lambda$0(DisksAllocationView.java:179) at org.ovirt.engine.ui.common.widget.uicommon.storage.DisksAllocationView$lambda$0$Type.eventRaised(DisksAllocationView.java:179) at org.ovirt.engine.ui.uicompat.Event.$raise(Event.java:99) at org.ovirt.engine.ui.uicommonweb.models.storage.DisksAllocationModel.$onPropertyChanged(DisksAllocationModel.java:310) at org.ovirt.engine.ui.uicommonweb.models.storage.DisksAllocationModel.$setQuotaEnforcementType(DisksAllocationModel.java:121) at org.ovirt.engine.ui.uicommonweb.models.vms.UnitVmModel.$compatibilityVersionChanged(UnitVmModel.java:2384) at org.ovirt.engine.ui.uicommonweb.models.vms.UnitVmModel.eventRaised(UnitVmModel.java:2223) at org.ovirt.engine.ui.uicompat.Event.$raise(Event.java:99) at org.ovirt.engine.ui.uicommonweb.models.ListModel.$setSelectedItem(ListModel.java:82) at org.ovirt.engine.ui.uicommonweb.builders.vm.CoreVmBaseToUnitBuilder.$postBuild(CoreVmBaseToUnitBuilder.java:35) at org.ovirt.engine.ui.uicommonweb.builders.vm.CoreVmBaseToUnitBuilder.postBuild(CoreVmBaseToUnitBuilder.java:35) at org.ovirt.engine.ui.uicommonweb.builders.CompositeBuilder$LastBuilder.build(CompositeBuilder.java:45) at org.ovirt.engine.ui.uicommonweb.builders.BaseSyncBuilder.build(BaseSyncBuilder.java:13) at org.ovirt.engine.ui.uicommonweb.builders.BaseSyncBuilder.build(BaseSyncBuilder.java:13) at org.ovirt.engine.ui.uicommonweb.builders.vm.IconVmBaseToUnitBuilder.lambda$0(IconVmBaseToUnitBuilder.java:23) at org.ovirt.engine.ui.uicommonweb.builders.vm.IconVmBaseToUnitBuilder$lambda$0$Type.onSuccess(IconVmBaseToUnitBuilder.java:23) at org.ovirt.engine.ui.uicommonweb.models.vms.IconCache.$lambda$0(IconCache.java:57) at org.ovirt.engine.ui.uicommonweb.models.vms.IconCache$lambda$0$Type.onSuccess(IconCache.java:57) at org.ovirt.engine.ui.frontend.Frontend$1.$onSuccess(Frontend.java:239) at org.ovirt.engine.ui.frontend.Frontend$1.onSuccess(Frontend.java:239) at org.ovirt.engine.ui.frontend.communication.OperationProcessor$1.$onSuccess(OperationProcessor.java:133) at org.ovirt.engine.ui.frontend.communication.OperationProcessor$1.onSuccess(OperationProcessor.java:133) at org.ovirt.engine.ui.frontend.communication.GWTRPCCommunicationProvider$5$1.$onSuccess(GWTRPCCommunicationProvider.java:270) at org.ovirt.engine.ui.frontend.communication.GWTRPCCommunicationProvider$5$1.onSuccess(GWTRPCCommunicationProvider.java:270) at com.google.gwt.user.client.rpc.impl.RequestCallbackAdapter.onResponseReceived(RequestCallbackAdapter.java:198) at com.google.gwt.http.client.Request.$fireOnResponseReceived(Request.java:233) at com.google.gwt.http.client.RequestBuilder$1.onReadyStateChange(RequestBuilder.java:409) at Unknown.eval(webadmin-0.js) at com.google.gwt.core.client.impl.Impl.apply(Impl.java:306) at com.google.gwt.core.client.impl.Impl.entry0(Impl.java:345) at Unknown.eval(webadmin-0.js) Any suggestions for resolve the problem? Thank you

2 2

GUI snapshot issue
by Facundo Badaracco 22 Sep '22

22 Sep '22

Hi everyone. I have made several snapshot of my vm, the interface says everything is good. But, when i go to the snapshot section of any vm, there is no snapshot. But, in disk section, all my snapshot are there. Any hint?

1 0

VMs hang periodically: gluster problem?
by Diego Ercolani 22 Sep '22

22 Sep '22

Hello, I have a cluster made by 3 nodes in a "self-hosted-engine" topology. I implemented the storage with gluster implementation in 2 replica + arbiter topology. I have two gluster volumes glen - is the volume used by hosted-engine vm gv0 - is the volume used by VMs The physical disks are 4TB SSD used only to accomodate VMs (also hosted-engine) I have continuos VMs hangs, even hosted-engine, this give full of troubles as I have continuous hangs by hosted-engine and this happen asyncrounosly even while there is management operation on VMs (mobility, cloning...) after a while it happens that the VM is freed but in the VMs I have in console kernel complaining by CPU hang or timer hangs and the solution is only to shutdown/poweroff the VM... even hosted engine in fact it happens that hosted-engine -vm-status give "state=EngineUpBadHealth" This is the log during the event in the host while there is the event: Sep 01 07:48:58 ovirt-node3.ovirt NetworkManager[1923]: <info> [1662018538.0166] device (vnet73): state change: activated -> unmanaged (reason 'unmanaged', sys-iface-state: 'removed') Sep 01 07:48:58 ovirt-node3.ovirt NetworkManager[1923]: <info> [1662018538.0168] device (vnet73): released from master device ovirtmgmt Sep 01 07:48:58 ovirt-node3.ovirt libvirtd[2496]: Unable to read from monitor: Connection reset by peer Sep 01 07:48:58 ovirt-node3.ovirt libvirtd[2496]: internal error: qemu unexpectedly closed the monitor: 2022-09-01T07:48:57.930955Z qemu-kvm: -device virtio-blk-pci,iothread=iothread1,bus=pci.6,addr=0x0,drive=libvirt-1-format,id=ua-0a1a501c-fc45-430f-bfd3-076172cec406,bootindex=1,write-cache=on,serial=0a1a501c-fc45-430f-bfd3-076172cec406,werror=stop,rerror=stop: Failed to get "write" lock Is another process using the image [/run/vdsm/storage/3577c21e-f757-4405-97d1-0f827c9b4e22/0a1a501c-fc45-430f-bfd3-076172cec406/f65dab86-67f1-46fa-87c0-f9076f479741]? Sep 01 07:48:58 ovirt-node3.ovirt kvm[268578]: 5 guests now active Sep 01 07:48:58 ovirt-node3.ovirt systemd[1]: machine-qemu\x2d67\x2dHostedEngine.scope: Succeeded. Sep 01 07:48:58 ovirt-node3.ovirt systemd-machined[1613]: Machine qemu-67-HostedEngine terminated. Sep 01 07:49:08 ovirt-node3.ovirt systemd[1]: NetworkManager-dispatcher.service: Succeeded. Sep 01 07:49:08 ovirt-node3.ovirt ovirt-ha-agent[3338]: ovirt-ha-agent ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM stopped on localhost Sep 01 07:49:14 ovirt-node3.ovirt vdsm[3335]: WARN Failed to retrieve Hosted Engine HA info: timed out Sep 01 07:49:18 ovirt-node3.ovirt vdsm[3335]: WARN Failed to retrieve Hosted Engine HA info: timed out Sep 01 07:49:18 ovirt-node3.ovirt vdsm[3335]: WARN Failed to retrieve Hosted Engine HA info: timed out Sep 01 07:49:28 ovirt-node3.ovirt vdsm[3335]: WARN Failed to retrieve Hosted Engine HA info: timed out Sep 01 07:49:28 ovirt-node3.ovirt sanlock[1633]: 2022-09-01 07:49:28 1706161 [5083]: s4 delta_renew long write time 11 sec Sep 01 07:49:28 ovirt-node3.ovirt sanlock[1633]: 2022-09-01 07:49:28 1706161 [5033]: s3 delta_renew long write time 11 sec Sep 01 07:49:34 ovirt-node3.ovirt libvirtd[2496]: Domain id=65 name='ocr-Brain-28-ovirt.dmz.ssis' uuid=00425fb1-c24b-4eaa-9683-534d66b2cb04 is tainted: custom-ga-command Sep 01 07:49:47 ovirt-node3.ovirt sudo[268984]: root : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/bin/privsep-helper --privsep_context os_brick.privileged.default --privsep_sock_path /tmp/tmp1iolt06i/privsep.sock This is the indication I have on gluster: [root@ovirt-node3 ~]# gluster volume heal gv0 info Brick ovirt-node2.ovirt:/brickgv0/_gv0 Status: Connected Number of entries: 0 Brick ovirt-node3.ovirt:/brickgv0/gv0_1 Status: Connected Number of entries: 0 Brick ovirt-node4.ovirt:/dati/_gv0 Status: Connected Number of entries: 0 [root@ovirt-node3 ~]# gluster volume heal glen info Brick ovirt-node2.ovirt:/brickhe/_glen Status: Connected Number of entries: 0 Brick ovirt-node3.ovirt:/brickhe/glen Status: Connected Number of entries: 0 Brick ovirt-node4.ovirt:/dati/_glen Status: Connected Number of entries: 0 So it seem healty. I don't know how to address the issue but this is a great problem.

2 19