October 2017 - Users - oVirt List Archives

Ovirt 4.2 Fencing problems
by Maton, Brett 10 Oct '17

10 Oct '17

Hi, I recently upgraded my testlab to oVirt 4.2-pre and have been having some issues. One of which is fencing with Dell Drac 8 remote access cards. The configuration I'm using works fine on my other (4.1.6) cluster... Error: Health check on Host host.example.com indicates that future attempts to Start this host using Power-Management are expected to fail. Fence agent setup for the host appears to be working though: Which logs would be helpful in debugging this issue ? Regards, Brett

2 7

VM-Clone issue
by Shamam Amir 09 Oct '17

09 Oct '17

Dear All, We are using oVirt release *Version 4.1.5.2. *I was trying to make a clone of a VM but I got the following message* "*Error while executing action CloneVm: Internal Engine Error", I found that when cloud-init is used in the vm's edit section, this error occurs, but when the vm is configured without cloud init, the clone was completed successfully. I wondered if this is a bug or something else? The log file is in this link https://paste.fedoraproject.org/paste/f~sHprL93BNZ-iJzoFDyww Best Regards

2 1

Dead hosts
by Maton, Brett 09 Oct '17

09 Oct '17

Hi, I've replaced some hardware, and didn't remote 'hosted engine deploy' before retireing the servers. (actually I failed to remove them properly...) How can I get rid of the 'old' hosts from the output of hosted-engine --vm-status ? oVirt 4.2 - pre centOS 7.4 Regards, Brett

2 2

Engine crash, storage won't activate, hosts won't shutdown, template locked, gpu passthrough failed
by M R 09 Oct '17

09 Oct '17

Hello! I have been using Ovirt for last four weeks, testing and trying to get things working. I have collected here the problems I have found and this might be a bit long but help to any of these or maybe to all of them from several people would be wonderful. My version is ovirt node 4.1.5 and 4.1.6 downloaded from website latest stable release at the time. Also tested with CentOS minimal +ovirt repo. In this case, 3. is solved, but other problems persist. 1. Power off host First day after installing ovirt node, it was able to reboot and shutdown clean. No problems at all. After few days of using ovir, I have noticed that hosts are unable to shutdown. I have tested this in several different ways and come to the following conclusion. IF engine has not been started after boot, all hosts are able to shutdown clean. But if engine is started even once, none of the hosts are able to shutdown anymore. The only way to get power off is to unplug or press power button for a longer time as hard reset. I have failed to find a way to have the engine running and then shutdown host. This effects to all hosts in the cluster. 2. Glusterfs failed Every time I have booted hosts, glusterfs has failed. For some reason, it turns inactive state even if I have setup systemctl enable glusterd. Before this command it was just inactive. After this command, it will say "failed (inactive). There is still a way to get glusterfs working. I have to give command systemctl start glusterd manually and everything starts working. Why do I have to give manual commands to start glusterfs? I have used this for CentOS before and never had this problem before. Node installer is that much different from the CentOS core? 3. Epel As I said that I have used CentOS before, I would like to able to install some packets from repo. But even if I install epel-release, it won't find packets such as nano or htop. I have read about how to add epel-release to ovirt node from here: https://www.ovirt.org/release/4.1.1/#epel I have tested even manually edit repolist, but it will fail to find normal epel packets. I have setup additional exclude=collectd* as guided in the link above. This doesn't make any difference. All being said I am able to install manually packets which are downloaded with other CentOS machine and transferred with scp to ovirt node. Still, this once again needs a lot of manual input and is just a workaround for the bug. 4. Engine startup When I try to start the engine when glusterfs is up, it will say vm doesn't exist, starting up. Still, it won't startup automatically. I have to give several times command hosted-engine --vm-start. I wait for about 5minutes until I give it next time. This will take usually about 30minutes and then randomly. Completely randomly after one of the times, I give this command engine shoots up and is up in 1minute. This has happened every time I boot up. And the times that I have to give a command to start the engine, has been changing. At best it's been 3rd time at worst it has been 7th time. Calculating from there it might take from 15minutes to 35minutes to get the engine up.Nevertheless, it will eventually come up every time. If there is a way to get it up on the first try or even better, automatically up, it would be great. 5. Activate storage Once the engine is up, there has been a problem with storage. When I go to storage tab, it will show all sources red. Even if I wait for 15~20minutes, it won't get storage green itself. I have to go and press active button from main data storage. Then it will get main storage up in 2~3munutes.Sometimes it fails it once, but will definitely get main data storage up on the seconds try. And then magically at the same time all other storages instantly go green. Main storage is glusterfs and I have 3 NFS storages as well. This is only a problem when starting up and once storages are on green they stay green. Still annoying that it cannot get it done by itself. 6.Template locked I try to create a template from existing VM and it resulted in original VM going into locked state and template being locked. I have read that some other people had a similar problem and they were suggested to restart engine to see if it solves it. For me it has been now a week and several restarts of engine and hosts, but there is still one VM locked and template locked as well. This is not a big problem, but still a problem. Everything is grey and cannot delete this bugged VM or template. 7. unable to use GPU I have been trying to do GPU passthrough with my VM. First, there was a problem with qemu cmd line, but once I figure out a way to get commands, it maybe is working(?). Log shows up fine, but it still doesn't give functionality I¨m looking for. As I mentioned in the other email that I have found this: https://www.mail-archive.com/users@ovirt.org/msg40422.html . It will give right syntax in log, but still, won't fix error 43 with nvidia drivers. If anybody got this working or has ideas how to do it, would really like to know how it's done properly. I have also tested with AMD graphics cards such as vega, but as soon as drivers have installed, I will get a black screen. Even if I restart VM or hosts or both. I will only see black screen and unable to use VM at all. I might be able to live with the other six things listed above, but this one is a bit of a problem for me. My use of VMs will eventually need graphical performance and therefore I will have to get this working or find an alternative to ovirt..I have found several things that I really like in ovirt and would prefer to use it. Best regards Mikko <https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campai…> Ei viruksia. www.avast.com <https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campai…> <#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>

3 2

Help - Host unresponsive!
by Wesley Stewart 09 Oct '17

09 Oct '17

I have a single server that is running ovirt as a host and a storage domain. I ran an update when the GUI told me that there was one. I shutdown all of the VM's, placed the host in maintenance mode and did the update. Upon rebooting the server, I was able to pull my VM's online without any issue, but then, all of a sudden, vm's stopped responding as well as the host. The host is showing as "Down" and I cannot "Activate" it. Also "Maintenance Mode" is greyed out. Storage pools are "down" but eventually come back online, but the host still will not come online. Every few minutes, the host goes to "UP" right before failing and going back "Down". To make it more confusing, like I mentioned before, this is all in one box, so it isn't like their is a network issue between the storage and the host boxes or anything. Currently trying to reinstall the host and see if that helps, but I would very much appreciate any sort of guidance, support or ideas! Fencing failed on Storage Pool Manager OVIRT-Host for Data Center OVIRT-Datacenter. Setting status to Non-Operational. Host OVIRT-Host failed to recover.

2 5

VM remote noVNC console
by Alex K 06 Oct '17

06 Oct '17

Hi all, I am trying to get the VM console of a VM through SSH socks proxy. This is a scenario I will frequently face, as the ovirt cluster will be available only though a remote SSH tunnel. I am trying several console options without success. With SPICE or VNC I get issue with virt-viewer saying "Unable to connect to libvirt with URI [none]' With noVNC I get a separate tab on browser where it is stuck showing "loading". Has anyone success with this kind of remote console access? Thanx, Alex

3 5

Debugging warning messages about bonding mode 4
by Gianluca Cecchi 06 Oct '17

06 Oct '17

Hello, on a 2 nodes cluster in 4.1.6 I have this situation. Every node has 3 bonds, each one composed by 2 network adapters and each one of type mode=4 (actually in setup networks I have configured custom and then the value: "mode=4 miimon=100" ) At this moment only one of the servers has access to FC storage, while the other is currently on maintenance. On 2 of the 3 bonds of the active server I get an exclamation point in "Network Interfaces" subtab with this mouseover popup Bond is in link aggregation mode (mode 4), but no partner mac has been reported for it What is the exact meaning of this message? Do I have to care about (I think so..)? What should I report to network guys? Eg, one of these two warning bonds status is: # cat /proc/net/bonding/bond2 Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011) Bonding Mode: IEEE 802.3ad Dynamic link aggregation Transmit Hash Policy: layer2 (0) MII Status: up MII Polling Interval (ms): 100 Up Delay (ms): 0 Down Delay (ms): 0 802.3ad info LACP rate: slow Min links: 0 Aggregator selection policy (ad_select): stable System priority: 65535 System MAC address: 48:df:37:0c:7f:5a Active Aggregator Info: Aggregator ID: 5 Number of ports: 2 Actor Key: 9 Partner Key: 6 Partner Mac Address: b8:38:61:9c:75:80 Slave Interface: ens2f2 MII Status: up Speed: 1000 Mbps Duplex: full Link Failure Count: 2 Permanent HW addr: 48:df:37:0c:7f:5a Slave queue ID: 0 Aggregator ID: 5 Actor Churn State: none Partner Churn State: none Actor Churned Count: 2 Partner Churned Count: 3 details actor lacp pdu: system priority: 65535 system mac address: 48:df:37:0c:7f:5a port key: 9 port priority: 255 port number: 1 port state: 61 details partner lacp pdu: system priority: 32768 system mac address: b8:38:61:9c:75:80 oper key: 6 port priority: 32768 port number: 293 port state: 61 Slave Interface: ens2f3 MII Status: up Speed: 1000 Mbps Duplex: full Link Failure Count: 2 Permanent HW addr: 48:df:37:0c:7f:5b Slave queue ID: 0 Aggregator ID: 5 Actor Churn State: none Partner Churn State: none Actor Churned Count: 0 Partner Churned Count: 3 details actor lacp pdu: system priority: 65535 system mac address: 48:df:37:0c:7f:5a port key: 9 port priority: 255 port number: 2 port state: 61 details partner lacp pdu: system priority: 32768 system mac address: b8:38:61:9c:75:80 oper key: 6 port priority: 32768 port number: 549 port state: 61 Also, the other node (that is currently in maintenance) shows one of the 2 interfaces of bond2 (ens2f2) as down (red arrow) but on the host # ip link show ens2f2 6: ens2f2: <BROADCAST,MULTICAST,SLAVE,UP,LOWER_UP> mtu 1500 qdisc mq master bond2 state UP mode DEFAULT qlen 1000 link/ether 48:df:37:0c:85:4e brd ff:ff:ff:ff:ff:ff # Does this depend on the host being in maintenance? Perhaps when a host is in maintenance, the warnings on it are not checked/updated again from engine? Thanks in advance, Gianluca

2 1

Unable to grant permissions to AD users
by Michael Watters 06 Oct '17

06 Oct '17

I'm having some issues granting permissions to AD users in ovirt-engine 4.1. Users can log in but receive an error as below. The user user@example.com@example.com is not authorized to perform login I am also not able to grant this user any permissions through the admin console. Entering a user name in the search field for the System Permissions section results in a blank list. Attached is a screenshot for reference. Does anybody have an idea on what would cause this? The log files aren't very useful and don't show any errors.

2 3

VDI whit Ovirt Spice and NVIDIA passtrought
by codignotto . 06 Oct '17

06 Oct '17

Good night friends, I'm trying to make the passtrought GPU run using OVIRT with an NVIDIA M2000 / K2200 and a K4000, I made several configurations, the VM with windows recognizes the card, I install the NVIDIA drivers and after restarting the windows and access using the client SPICE remote viwer the mouse to work, everything works because the mouse does not work for anything, already tested with Win8, Win10 and the problem persists, the card is recognized but the mouse does not work. Has anyone used it this way? Passtrought from GPU to a VM using SPICE? I'm going to use these clients with 3d and adobe products. Thank you for your help Deny

1 0

IOPS stats/reports from all hosts to storage
by Neil 05 Oct '17

05 Oct '17

Hi guys, I'm running FC storage with 4 hosts on oVirt 3.6 and we've been having some IOPS issues recently and the SAN provider has asked me to provide them with the following info... Datastore Stripe Size Default VM Disk Stripe Size Average IO Size Average THROUGHPUT (MB/s) Average IOPS Maximum IOPS Read/Write Percentage of IO Datastore Average Latency VM Disk Average Latency All of this is from across all hosts and VM's to the storage domain. Is there any way to get this kind of info from oVirt? I've been looking at oVirt-reports but I don't see much as far as IO/throughput reporting goes. Apologies if I've missed something obvious. Thanks. Regards. Neil Wilson.

3 6