4.4.9 -> 4.4.10 Cannot start or migrate any VM (hotpluggable cpus requested exceeds the maximum cpus supported by KVM)
by Jillian Morgan
After upgrading the engine from 4.4.9 to 4.4.10, and then upgrading one
host, any attempt to migrate a VM to that host or start a VM on that host
results in the following error:
Number of hotpluggable cpus requested (16) exceeds the maximum cpus
supported by KVM (8)
While the version of qemu is the same across hosts, (
qemu-kvm-6.0.0-33.el8s.x86_64), I traced the difference to the upgraded
kernel on the new host. I have always run elrepo's kernel-ml on these hosts
to support bcache which RHEL's kernel doesn't support. The working hosts
still run kernel-ml-5.15.12. The upgraded host ran kernel-ml-5.17.0.
In case anyone else runs kernel-ml, have you run into this issue?
Does anyone know why KVM's KVM_CAP_MAX_VCPUS value is lowered on the new
kernel?
Does anyone know how to query the KVM capabilities from userspace without
writing a program leveraging kvm_ioctl()'s?
Related to this, it seems that ovirt and/or libvirtd always runs qmu-kvm
with an -smp argument of "maxcpus=16". This causes qemu's built-in check to
fail on the new kernel which is supporting max_vpus of 8.
Why does ovirt always request maxcpus=16?
And yes, before you say it, I know you're going to say that running
kernel-ml isn't supported.
--
Jillian Morgan (she/her) 🏳️⚧️
Systems & Networking Specialist
Primordial Software Group & I.T. Consultancy
https://www.primordial.ca
1 year, 7 months
Out-of-sync networks can only be detached
by Sakhi Hadebe
Hi,
I have a 3-node oVirt cluster. I have configured 2 logical networks:
ovirtmgmt and public. Public logical network is attached in only 2 nodes
and failing to attach on the 3rd node with the below error
Invalid operation, out-of-sync network 'public' can only be detached.
Please have been stuck on this for almost the whole day now. How do I fix
this error?
--
Regards,
Sakhi Hadebe
1 year, 9 months
oVirt + TrueNAS: Unable to create iSCSI domain - I am missing something obvious
by David Johnson
Good morning folks, and thank you in advance.
I am working on migrating my oVirt backing store from NFS to iSCSI.
*oVirt Environment:*
oVirt Open Virtualization Manager
Software Version:4.4.4.7-1.el8
*TrueNAS environment:*
FreeBSD truenas.local 12.2-RELEASE-p11 75566f060d4(HEAD) TRUENAS amd64
The iSCSI share is on a TrueNAS server, exposed to user VDSM and group 36.
oVirt sees the targeted share, but is unable to make use of it.
The latest issue is "Error while executing action New SAN Storage Domain:
Volume Group block size error, please check your Volume Group
configuration, Supported block size is 512 bytes."
As near as I can tell, oVirt does not support any block size other than 512
bytes, while TrueNAS's smallest OOB block size is 4k.
I know that oVirt on TrueNAS is a common configuration, so I expect I am
missing something really obvious here, probably a TrueNAS configuration
needed to make TrueNAS work with 512 byte blocks.
Any advice would be helpful.
*David Johnson*
1 year, 10 months
Network Address Change
by Paul.LKW
Hi All:
I just has a case, I need to change the oVirt host and engine IP address
due to data center decommission I checked in the hosted-engine host
there are some files I could change ;
in ovirt-hosted-engine/hosted-engine.conf
ca_subject="O=simple.com, CN=1.2.3.4"
gateway=1.2.3.254
and of course I need to change the ovirtmgmt interface IP too, I think
just change the above line could do the tick, but where could I change
the other host IP in the cluster ?
I think I have to be lost all the host as once changed the hosted-engine
host IP as it is in diff. sub net.
Does there any command line tools could do that or someone has such
experience could share?
Best Regards,
Paul.LKW
1 year, 11 months
Re: Migrated a VM from VMware to Ovirt using import ova option. Task stuck and running for day 2 now.
by Saif Abu Saleh
+ @users <users(a)ovirt.org>
So it will be more visible to others
Thanks,
Saif
On Wed, Dec 8, 2021 at 3:53 PM Sudhakar S <sudhakar.sankaralingam(a)gmail.com>
wrote:
> Hi Team,
>
> It automatically failed at last.
> Thanks for the help guys.
>
> [image: image.png]
>
> On Wed, Dec 8, 2021 at 6:34 PM Tomáš Golembiovský <tgolembi(a)redhat.com>
> wrote:
>
>> This is another instance of bug
>> https://bugzilla.redhat.com/show_bug.cgi?id=2026809
>>
>> To get rid of the import task you have to:
>>
>> 1) move the host to maintenance
>> 2) restart VDSM with: systemctl restart vdsmd
>> 3) move the host back UP
>> 4) restart engine
>>
>>
>> By the way, the VMware imports finished fine but the OVA is broken and
>> will fail the import.
>>
>> Tomas
>>
>> On Wed, Dec 08, 2021 at 03:27:32PM +0530, Sudhakar S wrote:
>> > Hi Team,
>> >
>> > not intentionally, did mistakenly.
>> >
>> > Please find attached /var/log/vdsm/import FYR.
>> >
>> > On Wed, Dec 8, 2021 at 3:12 PM Arik Hadas <ahadas(a)redhat.com> wrote:
>> >
>> > > Hi,
>> > > Did you intentionally send it directly to me and not to the list?
>> > > Adding Saif and Tomas
>> > > But I think they'll need the import logs (AFAIR in
>> /var/log/vdsm/import)
>> > > to investigate it further
>> > >
>> > > On Wed, Dec 8, 2021 at 11:34 AM Sudhakar S <
>> > > sudhakar.sankaralingam(a)gmail.com> wrote:
>> > >
>> > >> HI Team,
>> > >>
>> > >> Please find the logs FYR.
>> > >>
>> > >> I just want to kill this task alone since it is creating 7000 events
>> per
>> > >> day. Please give me commands to stop this task.
>> > >>
>> > >> [image: image.png]
>> > >>
>> > >> Regards,
>> > >> Sudhakar
>> > >>
>> > >> On Wed, Dec 8, 2021 at 2:07 PM Arik Hadas <ahadas(a)redhat.com> wrote:
>> > >>
>> > >>>
>> > >>>
>> > >>> On Wed, Dec 8, 2021 at 8:44 AM <sudhakar.sankaralingam(a)gmail.com>
>> wrote:
>> > >>>
>> > >>>> 1. First node that was upgraded now continuously logs this error:
>> > >>>> VDSM <node name> command Get Host Statistics failed: Internal
>> JSON-RPC
>> > >>>> error:
>> > >>>> {'reason': "'str' object has no attribute 'decode'"}
>> > >>>> 2. During import of two virtual machines from VMware (has done
>> multible
>> > >>>> before upgrade)
>> > >>>> the import seems to never finish - currently running on day 2-3...
>> any
>> > >>>> clues how to fix
>> > >>>> this?
>> > >>>>
>> > >>>> Events VDSM <node name> command Get Host Statistics failed:
>> Internal
>> > >>>> JSON-RPC error:
>> > >>>> {'reason': "'str' object has no attribute 'decode'"} running for 2
>> days
>> > >>>> every 20 seconds. Please suggest
>> > >>>>
>> > >>>
>> > >>> Looks like [1], however in that report the import process succeeded
>> > >>> despite those errors.
>> > >>> Can you please share the import logs?
>> > >>>
>> > >>> [1] https://bugzilla.redhat.com/show_bug.cgi?id=2026809
>> > >>>
>> > >>>
>> > >>>> _______________________________________________
>> > >>>> Users mailing list -- users(a)ovirt.org
>> > >>>> To unsubscribe send an email to users-leave(a)ovirt.org
>> > >>>> Privacy Statement: https://www.ovirt.org/privacy-policy.html
>> > >>>> oVirt Code of Conduct:
>> > >>>> https://www.ovirt.org/community/about/community-guidelines/
>> > >>>> List Archives:
>> > >>>>
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/37YK2JLWZQX...
>> > >>>>
>> > >>>
>>
>>
>>
>>
>>
>>
>>
>>
>> --
>> Tomáš Golembiovský <tgolembi(a)redhat.com>
>>
>>
1 year, 11 months
what happens to vms when a host shutdowns?
by Nathanaël Blanchet
Hi,
We are installing UPS powerchute client on hypervisors.
What is the default vms behaviour of running vms when an hypervisor is
ordered to shutdown: do the vms live migrate or do they shutdown
properly (even the restart on an other host because of HA) ?
--
Nathanaël Blanchet
Supervision réseau
SIRE
227 avenue Professeur-Jean-Louis-Viala
34193 MONTPELLIER CEDEX 5
Tél. 33 (0)4 67 54 84 55
Fax 33 (0)4 67 54 84 14
blanchet(a)abes.fr
2 years
[4.4.7] The latest guest agent needs to be installed and running on the guest
by Christoph Timm
Hi list,
I have upgraded my oVirt from 4.4.4 to 4.4.7 and notice that some VMs
have issues to report the performance information via the guest agent.
I see the following message on the VMs: The latest guest agent needs to
be installed and running on the guest
The qemu-guest agent is installed on VMs (the VM OS is CentOS 8).
Any advice how to troubleshoot this?
I can see the stats will be presented in the GUI, if I migrate the VM to
a different host.
Best regards
Christoph
2 years
VM Disk extend not reflected in VM oS
by simon@justconnect.ie
Hi All,
I extended a VM disk via oVirt Manager (4.4.8) without error, but when I checked the VM (fdisk & lsblk) the size has not changed.
Any help as always would be appreciated.
Kind Regards
Simon...
2 years, 1 month
Certificate expiration
by Joseph Gelinas
Hi,
The certificates on our oVirt stack recently expired, while all the VMs are still up, I can't put the cluster into global maintenance via ovirt-engine, or do anything via ovirt-engine for that matter. Just get event logs about cert validity.
VDSM ovirt-1.xxxxx.com command Get Host Capabilities failed: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed
VDSM ovirt-2.xxxxx.com command Get Host Capabilities failed: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed
VDSM ovirt-3.xxxxx.com command Get Host Capabilities failed: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed
Under Compute -> Hosts, all are status Unassigned. Default data center is status Non Responsive.
I have tried a couple of solutions to regenerate the certificates without much luck and have copied the originals back in place.
https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.3/...
https://access.redhat.com/solutions/2409751
I have seen things saying running engine-setup will generate new certs, however engine doesn't think the cluster is in global maintenance so won't run that, I believe I can get around the check with `engine-setup --otopi-environment=OVESETUP_CONFIG/continueSetupOnHEVM=bool:True` but is that the right thing to do? Will it deploy the certs on to the hosts as well so things communicate properly? Looks like one is supposed to put a node into maintenance and reenroll it after doing the engine-setup, but will it even be able to put the nodes into maintenance given I can't do anything with them now?
Appreciate any ideas.
2 years, 2 months