September 2018 - Users - oVirt List Archives

VM poor iops
by Leo David 01 Mar '19

01 Mar '19

Hi Everyone, I am encountering the following issue on a single instance hyper-converged 4.2 setup. The following fio test was done: fio --randrepeat=1 --ioengine=libaio --direct=1 --gtod_reduce=1 --name=test --filename=test --bs=4k --iodepth=64 --size=4G --readwrite=randwrite The results are very poor doing the test inside of a vm with a prealocated disk on the ssd store: ~2k IOPS Same test done on the oVirt node directly on the mounted ssd_lvm: ~30k IOPS Same test done, this time on the gluster mount path: ~20K IOPS What could be the issue that the vms have this slow hdd performance ( 2k on ssd !! )? Thank you very much ! -- Best regards, Leo David

4 11

Feature: Hosted engine VM management
by Roy Golan 27 Feb '19

27 Feb '19

Hi all, Upcoming in 3.6 is enhancement for managing the hosted engine VM. In short, we want to: * Allow editing the Hosted engine VM, storage domain, disks, networks etc * Have a shared configuration for the hosted engine VM * Have a backup for the hosted engine VM configuration please review and comment on the wiki below: http://www.ovirt.org/Hosted_engine_VM_management Thanks, Roy

2 3

Re: [ovirt-users] Large DWH Database, how to empty
by Matt . 26 Feb '19

26 Feb '19

Hi, OK thanks! I saw that after upgrading to 4.0.5 from 4.0.4 the DB already dropped with around 500MB directly and is now at 2GB smaller. Does this sounds familiar to you with other settings in 4.0.5 ? Thanks, Matt 2017-01-08 10:45 GMT+01:00 Shirly Radco <sradco(a)redhat.com>: > No. That will corrupt your database. > > Are you using the full dwh or the smaller version for the dashboards? > > Please set the delete thresholds to save less data and the data older then > the time you set will be deleted. > Add a file to /ovirt-engine-dwhd.conf.d/ > update_time_to_keep_records.conf > > Add these lines with the new configurations. The numbers represent the hours > to keep the data. > > DWH_TABLES_KEEP_SAMPLES=24 > DWH_TABLES_KEEP_HOURLY=1440 > DWH_TABLES_KEEP_DAILY=43800 > > > These are the configurations for a full dwh. > > The smaller version configurations are: > DWH_TABLES_KEEP_SAMPLES=24 > DWH_TABLES_KEEP_HOURLY=720 > DWH_TABLES_KEEP_DAILY=0 > > The delete process by default at 3am every day (DWH_DELETE_JOB_HOUR=3) > > Best regards, > > Shirly Radco > > BI Software Engineer > Red Hat Israel Ltd. > 34 Jerusalem Road > Building A, 4th floor > Ra'anana, Israel 4350109 > > > On Fri, Jan 6, 2017 at 6:35 PM, Matt . <yamakasi.014(a)gmail.com> wrote: >> >> Hi, >> >> I seem to have some large database for the DWH logging and I wonder >> how I can empty it safely. >> >> Can I just simply empty the database ? >> >> Have a good weekend! >> >> Cheers, >> >> Matt >> _______________________________________________ >> Users mailing list >> Users(a)ovirt.org >> http://lists.ovirt.org/mailman/listinfo/users > >

1 0

Re: [ovirt-users] Packet loss
by Doron Fediuck 26 Feb '19

26 Feb '19

----_com.android.email_640187878761650 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: base64 SGkgS3lsZSzCoApXZSBtYXkgaGF2ZSBzZWVuIHNvbWV0aGluZyBzaW1pbGFyIGluIHRoZSBwYXN0 IGJ1dCBJIHRoaW5rIHRoZXJlIHdlcmUgdmxhbnMgaW52b2x2ZWQuwqAKSXMgaXQgdGhlIHNhbWUg Zm9yIHlvdT/CoApUb255IC8gRGFuLCBkb2VzIGl0IHJpbmcgYSBiZWxsP8Kg ----_com.android.email_640187878761650 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: base64 PGh0bWw+PGhlYWQ+PG1ldGEgaHR0cC1lcXVpdj0iQ29udGVudC1UeXBlIiBjb250ZW50PSJ0ZXh0 L2h0bWw7IGNoYXJzZXQ9VVRGLTgiPjwvaGVhZD48Ym9keSA+PGRpdj5IaSBLeWxlLCZuYnNwOzwv ZGl2PjxkaXY+V2UgbWF5IGhhdmUgc2VlbiBzb21ldGhpbmcgc2ltaWxhciBpbiB0aGUgcGFzdCBi dXQgSSB0aGluayB0aGVyZSB3ZXJlIHZsYW5zIGludm9sdmVkLiZuYnNwOzwvZGl2PjxkaXY+SXMg aXQgdGhlIHNhbWUgZm9yIHlvdT8mbmJzcDs8L2Rpdj48ZGl2PlRvbnkgLyBEYW4sIGRvZXMgaXQg cmluZyBhIGJlbGw/Jm5ic3A7PC9kaXY+PC9ib2R5PjwvaHRtbD4= ----_com.android.email_640187878761650--

2 1

Using the web-ui VM portal through a proxy failing
by Callum Smith 15 Feb '19

15 Feb '19

Dear oVirt Gurus, Using the oVirt user VM portal seems to not work through the squid proxy setup (configured as per the guide). The page loads and login works fine through the proxy, but the asynchronous requests just hang. I've attached a screenshot, but you can see the "api" endpoint just hanging in a web inspector: "https://proxyfqdn/ovirt-engine/api/" [cid:CA42E493-3AD9-45F8-B4C3-C914F059390C@well.ox.ac.uk] This works fine when not going through the proxy. Is there a way to force noVNC HTML as the console mode through the web-ui, or at least have it as an option if not default? The console seems not to work when logged in with a base 'user role'. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. callum(a)well.ox.ac.uk<mailto:callum@well.ox.ac.uk>

2 14

sun.security.validator
by suporte＠logicworks.pt 06 Feb '19

06 Feb '19

Hi, I'm running Version 4.2.3.8-1.el7, and after reboot the engine machine no longer could login into administration portal with this error: sun.security.validator.ValidatorException: PKIX path validation faile java.security.cert.CertPathValidatorException: validity check failed I'm using a self signed cert. Any idea? Thanks -- Jose Ferradeira http://www.logicworks.pt

5 5

Ovirt cluster unstable; gluster to blame (again)
by Jim Kusznir 04 Feb '19

04 Feb '19

hi all: Once again my production ovirt cluster is collapsing in on itself. My servers are intermittently unavailable or degrading, customers are noticing and calling in. This seems to be yet another gluster failure that I haven't been able to pin down. I posted about this a while ago, but didn't get anywhere (no replies that I found). The problem started out as a glusterfsd process consuming large amounts of ram (up to the point where ram and swap were exhausted and the kernel OOM killer killed off the glusterfsd process). For reasons not clear to me at this time, that resulted in any VMs running on that host and that gluster volume to be paused with I/O error (the glusterfs process is usually unharmed; why it didn't continue I/O with other servers is confusing to me). I have 3 servers and a total of 4 gluster volumes (engine, iso, data, and data-hdd). The first 3 are replica 2+arb; the 4th (data-hdd) is replica 3. The first 3 are backed by an LVM partition (some thin provisioned) on an SSD; the 4th is on a seagate hybrid disk (hdd + some internal flash for acceleration). data-hdd is the only thing on the disk. Servers are Dell R610 with the PERC/6i raid card, with the disks individually passed through to the OS (no raid enabled). The above RAM usage issue came from the data-hdd volume. Yesterday, I cought one of the glusterfsd high ram usage before the OOM-Killer had to run. I was able to migrate the VMs off the machine and for good measure, reboot the entire machine (after taking this opportunity to run the software updates that ovirt said were pending). Upon booting back up, the necessary volume healing began. However, this time, the healing caused all three servers to go to very, very high load averages (I saw just under 200 on one server; typically they've been 40-70) with top reporting IO Wait at 7-20%. Network for this volume is a dedicated gig network. According to bwm-ng, initially the network bandwidth would hit 50MB/s (yes, bytes), but tailed off to mostly in the kB/s for a while. All machines' load averages were still 40+ and gluster volume heal data-hdd info reported 5 items needing healing. Server's were intermittently experiencing IO issues, even on the 3 gluster volumes that appeared largely unaffected. Even the OS activities on the hosts itself (logging in, running commands) would often be very delayed. The ovirt engine was seemingly randomly throwing engine down / engine up / engine failed notifications. Responsiveness on ANY VM was horrific most of the time, with random VMs being inaccessible. I let the gluster heal run overnight. By morning, there were still 5 items needing healing, all three servers were still experiencing high load, and servers were still largely unstable. I've noticed that all of my ovirt outages (and I've had a lot, way more than is acceptable for a production cluster) have come from gluster. I still have 3 VMs who's hard disk images have become corrupted by my last gluster crash that I haven't had time to repair / rebuild yet (I believe this crash was caused by the OOM issue previously mentioned, but I didn't know it at the time). Is gluster really ready for production yet? It seems so unstable to me.... I'm looking at replacing gluster with a dedicated NFS server likely FreeNAS. Any suggestions? What is the "right" way to do production storage on this (3 node cluster)? Can I get this gluster volume stable enough to get my VMs to run reliably again until I can deploy another storage solution? --Jim

11 23

Backup & Restore
by suporte＠logicworks.pt 27 Dec '18

27 Dec '18

------=_Part_20329569_1409874801.1519819859027 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Hi, I'm testing backup & restore on Ovirt 4.2. I follow this doc https://www.ovirt.org/documentation/admin-guide/chap-Backups_and_Migration/ Try to restore to a fresh installation but always get this error message: restore-permissions Preparing to restore: - Unpacking file 'back_file' Restoring: - Files Provisioning PostgreSQL users/databases: - user 'engine', database 'engine' Restoring: FATAL: Can't connect to database 'ovirt_engine_history'. Please see '/usr/bin/engine-backup --help'. On the live engine I run # engine-backup --scope=all --mode=backup --file=file_name --log=log_file_name And try to restore on a fresh installation: # engine-backup --mode=restore --file=file_name --log=log_file_name --provision-db --restore-permissions Any Idea? Thanks -- Jose Ferradeira http://www.logicworks.pt ------=_Part_20329569_1409874801.1519819859027 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable <html><body><div style=3D"font-family: trebuchet ms,sans-serif; font-size: = 12pt; color: #000000"><div data-marker=3D"__QUOTED_TEXT__"><div style=3D"fo= nt-family: Times New Roman; font-size: 10pt; color: #000000;" data-mce-styl= e=3D"font-family: Times New Roman; font-size: 10pt; color: #000000;"><div>H= i,<br></div><br><div><span style=3D"font-size: 11pt; font-family: terminal,= monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;= ">I'm testing backup & restore on Ovirt 4.2.</span><br></div><div><span= style=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style= =3D"font-size: 11pt; font-family: terminal, monaco;">I follow this doc http= s://www.ovirt.org/documentation/admin-guide/chap-Backups_and_Migration/</sp= an><br></div><div><span style=3D"font-size: 11pt; font-family: terminal, mo= naco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;">T= ry to restore to a fresh installation but always get this error message:</s= pan><br></div><br><div><span style=3D"font-size: 11pt; font-family: termina= l, monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, monac= o;">restore-permissions</span><br><span style=3D"font-size: 11pt; font-fami= ly: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font-family: term= inal, monaco;">Preparing to restore:</span><br><span style=3D"font-size: 11= pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font= -family: terminal, monaco;">- Unpacking file 'back_file'</span><br><span st= yle=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"f= ont-size: 11pt; font-family: terminal, monaco;">Restoring:</span><br><span = style=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D= "font-size: 11pt; font-family: terminal, monaco;">- Files</span><br><span s= tyle=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"= font-size: 11pt; font-family: terminal, monaco;">Provisioning PostgreSQL us= ers/databases:</span><br><span style=3D"font-size: 11pt; font-family: termi= nal, monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, mon= aco;">- user 'engine', database 'engine'</span><br><span style=3D"font-size= : 11pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; = font-family: terminal, monaco;">Restoring:</span><br><span style=3D"font-si= ze: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt= ; font-family: terminal, monaco;">FATAL: Can't connect to database 'ovirt_e= ngine_history'. Please see '/usr/bin/engine-backup --help'.</span><br></div= ><br><div><span style=3D"font-size: 11pt; font-family: terminal, monaco;" d= ata-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;">On the li= ve engine I run # engine-backup --scope=3Dall --mode=3Dbackup --file= =3Dfile_name --log=3Dlog_file_name</span></div><div><span style=3D"font-siz= e: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt;= font-family: terminal, monaco;"><br data-mce-bogus=3D"1"></span></div><div= ><span style=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-s= tyle=3D"font-size: 11pt; font-family: terminal, monaco;">And try to restore= on a fresh installation:</span><br></div><div><span style=3D"font-size: 11= pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font= -family: terminal, monaco;"># engine-backup --mode=3Drestore --file=3Dfile_= name --log=3Dlog_file_name --provision-db --restore-permissions</span></div= ><div><span style=3D"font-size: 11pt; font-family: terminal, monaco;" data-= mce-style=3D"font-size: 11pt; font-family: terminal, monaco;"><br data-mce-= bogus=3D"1"></span></div><div><span style=3D"font-size: 11pt; font-family: = terminal, monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal= , monaco;">Any Idea?<br data-mce-bogus=3D"1"></span></div><div><span style= =3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font= -size: 11pt; font-family: terminal, monaco;"><br data-mce-bogus=3D"1"></spa= n></div><div><span style=3D"font-size: 11pt; font-family: terminal, monaco;= " data-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;">Thanks= <br data-mce-bogus=3D"1"></span></div><div><span style=3D"font-size: 11pt; = font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font-fam= ily: terminal, monaco;"><br data-mce-bogus=3D"1"></span></div><div><span st= yle=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"f= ont-size: 11pt; font-family: terminal, monaco;"><br data-mce-bogus=3D"1"></= span></div><br><div>-- <br></div><div><hr style=3D"width: 100%; height: 2px= ;" data-mce-style=3D"width: 100%; height: 2px;">Jose Ferradeira<br>http://w= ww.logicworks.pt</div></div><br></div></div></body></html> ------=_Part_20329569_1409874801.1519819859027--

4 16

Out of sync, hosts network config differs from DC
by femi adegoke 02 Dec '18

02 Dec '18

Property: default route, host - true, DC - false I have 4 nics. bond0 = 2 x 10g eno1 = ovirtmgmt eno2 = for vm traffic. eno2 says it's out of sync, hosts network config differs from DC, default route: host - true, DC - false. I have tried "sync all networks" but the message still remains See attached. Where can I look to fix the issue?

3 2

oVirt Node on CentOS 7.5 and AMD EPYC Support
by Tobias Scheinert 30 Nov '18

30 Nov '18

Hi, I am currently building a new virtualization cluster with oVirt, using AMD EPYC processors (AMD EPYC 7351P). At the moment I'm running oVirt Node Version 4.2.3 @ CentOS 7.4.1708. We have the situation that the processor type is recognized as "AMD Opteron G3". With this type of instruction set the VMs are not able to do AES in hardware, this results in poor performance in our case. I found some information that tells me that this problem should be solved with CentOS 7.5 --> <https://access.redhat.com/errata/RHEA-2018:1488> My actual questions: - Are there any further information about the AMD EPYC support? - Any information about an update of the oVirt node to CentOS 7.5? Greeting Tobias

7 10