sun.security.validator
by suporte@logicworks.pt
Hi,
I'm running Version 4.2.3.8-1.el7, and after reboot the engine machine no longer could login into administration portal with this error:
sun.security.validator.ValidatorException: PKIX path validation faile
java.security.cert.CertPathValidatorException: validity check failed
I'm using a self signed cert.
Any idea?
Thanks
--
Jose Ferradeira
http://www.logicworks.pt
5 years, 9 months
Ovirt cluster unstable; gluster to blame (again)
by Jim Kusznir
hi all:
Once again my production ovirt cluster is collapsing in on itself. My
servers are intermittently unavailable or degrading, customers are noticing
and calling in. This seems to be yet another gluster failure that I
haven't been able to pin down.
I posted about this a while ago, but didn't get anywhere (no replies that I
found). The problem started out as a glusterfsd process consuming large
amounts of ram (up to the point where ram and swap were exhausted and the
kernel OOM killer killed off the glusterfsd process). For reasons not
clear to me at this time, that resulted in any VMs running on that host and
that gluster volume to be paused with I/O error (the glusterfs process is
usually unharmed; why it didn't continue I/O with other servers is
confusing to me).
I have 3 servers and a total of 4 gluster volumes (engine, iso, data, and
data-hdd). The first 3 are replica 2+arb; the 4th (data-hdd) is replica
3. The first 3 are backed by an LVM partition (some thin provisioned) on
an SSD; the 4th is on a seagate hybrid disk (hdd + some internal flash for
acceleration). data-hdd is the only thing on the disk. Servers are Dell
R610 with the PERC/6i raid card, with the disks individually passed through
to the OS (no raid enabled).
The above RAM usage issue came from the data-hdd volume. Yesterday, I
cought one of the glusterfsd high ram usage before the OOM-Killer had to
run. I was able to migrate the VMs off the machine and for good measure,
reboot the entire machine (after taking this opportunity to run the
software updates that ovirt said were pending). Upon booting back up, the
necessary volume healing began. However, this time, the healing caused all
three servers to go to very, very high load averages (I saw just under 200
on one server; typically they've been 40-70) with top reporting IO Wait at
7-20%. Network for this volume is a dedicated gig network. According to
bwm-ng, initially the network bandwidth would hit 50MB/s (yes, bytes), but
tailed off to mostly in the kB/s for a while. All machines' load averages
were still 40+ and gluster volume heal data-hdd info reported 5 items
needing healing. Server's were intermittently experiencing IO issues, even
on the 3 gluster volumes that appeared largely unaffected. Even the OS
activities on the hosts itself (logging in, running commands) would often
be very delayed. The ovirt engine was seemingly randomly throwing engine
down / engine up / engine failed notifications. Responsiveness on ANY VM
was horrific most of the time, with random VMs being inaccessible.
I let the gluster heal run overnight. By morning, there were still 5 items
needing healing, all three servers were still experiencing high load, and
servers were still largely unstable.
I've noticed that all of my ovirt outages (and I've had a lot, way more
than is acceptable for a production cluster) have come from gluster. I
still have 3 VMs who's hard disk images have become corrupted by my last
gluster crash that I haven't had time to repair / rebuild yet (I believe
this crash was caused by the OOM issue previously mentioned, but I didn't
know it at the time).
Is gluster really ready for production yet? It seems so unstable to
me.... I'm looking at replacing gluster with a dedicated NFS server likely
FreeNAS. Any suggestions? What is the "right" way to do production
storage on this (3 node cluster)? Can I get this gluster volume stable
enough to get my VMs to run reliably again until I can deploy another
storage solution?
--Jim
5 years, 9 months
Backup & Restore
by suporte@logicworks.pt
------=_Part_20329569_1409874801.1519819859027
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
Hi,
I'm testing backup & restore on Ovirt 4.2.
I follow this doc https://www.ovirt.org/documentation/admin-guide/chap-Backups_and_Migration/
Try to restore to a fresh installation but always get this error message:
restore-permissions
Preparing to restore:
- Unpacking file 'back_file'
Restoring:
- Files
Provisioning PostgreSQL users/databases:
- user 'engine', database 'engine'
Restoring:
FATAL: Can't connect to database 'ovirt_engine_history'. Please see '/usr/bin/engine-backup --help'.
On the live engine I run # engine-backup --scope=all --mode=backup --file=file_name --log=log_file_name
And try to restore on a fresh installation:
# engine-backup --mode=restore --file=file_name --log=log_file_name --provision-db --restore-permissions
Any Idea?
Thanks
--
Jose Ferradeira
http://www.logicworks.pt
------=_Part_20329569_1409874801.1519819859027
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: quoted-printable
<html><body><div style=3D"font-family: trebuchet ms,sans-serif; font-size: =
12pt; color: #000000"><div data-marker=3D"__QUOTED_TEXT__"><div style=3D"fo=
nt-family: Times New Roman; font-size: 10pt; color: #000000;" data-mce-styl=
e=3D"font-family: Times New Roman; font-size: 10pt; color: #000000;"><div>H=
i,<br></div><br><div><span style=3D"font-size: 11pt; font-family: terminal,=
monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;=
">I'm testing backup & restore on Ovirt 4.2.</span><br></div><div><span=
style=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=
=3D"font-size: 11pt; font-family: terminal, monaco;">I follow this doc http=
s://www.ovirt.org/documentation/admin-guide/chap-Backups_and_Migration/</sp=
an><br></div><div><span style=3D"font-size: 11pt; font-family: terminal, mo=
naco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;">T=
ry to restore to a fresh installation but always get this error message:</s=
pan><br></div><br><div><span style=3D"font-size: 11pt; font-family: termina=
l, monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, monac=
o;">restore-permissions</span><br><span style=3D"font-size: 11pt; font-fami=
ly: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font-family: term=
inal, monaco;">Preparing to restore:</span><br><span style=3D"font-size: 11=
pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font=
-family: terminal, monaco;">- Unpacking file 'back_file'</span><br><span st=
yle=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"f=
ont-size: 11pt; font-family: terminal, monaco;">Restoring:</span><br><span =
style=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D=
"font-size: 11pt; font-family: terminal, monaco;">- Files</span><br><span s=
tyle=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"=
font-size: 11pt; font-family: terminal, monaco;">Provisioning PostgreSQL us=
ers/databases:</span><br><span style=3D"font-size: 11pt; font-family: termi=
nal, monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal, mon=
aco;">- user 'engine', database 'engine'</span><br><span style=3D"font-size=
: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; =
font-family: terminal, monaco;">Restoring:</span><br><span style=3D"font-si=
ze: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt=
; font-family: terminal, monaco;">FATAL: Can't connect to database 'ovirt_e=
ngine_history'. Please see '/usr/bin/engine-backup --help'.</span><br></div=
><br><div><span style=3D"font-size: 11pt; font-family: terminal, monaco;" d=
ata-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;">On the li=
ve engine I run # engine-backup --scope=3Dall --mode=3Dbackup --file=
=3Dfile_name --log=3Dlog_file_name</span></div><div><span style=3D"font-siz=
e: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt;=
font-family: terminal, monaco;"><br data-mce-bogus=3D"1"></span></div><div=
><span style=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-s=
tyle=3D"font-size: 11pt; font-family: terminal, monaco;">And try to restore=
on a fresh installation:</span><br></div><div><span style=3D"font-size: 11=
pt; font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font=
-family: terminal, monaco;"># engine-backup --mode=3Drestore --file=3Dfile_=
name --log=3Dlog_file_name --provision-db --restore-permissions</span></div=
><div><span style=3D"font-size: 11pt; font-family: terminal, monaco;" data-=
mce-style=3D"font-size: 11pt; font-family: terminal, monaco;"><br data-mce-=
bogus=3D"1"></span></div><div><span style=3D"font-size: 11pt; font-family: =
terminal, monaco;" data-mce-style=3D"font-size: 11pt; font-family: terminal=
, monaco;">Any Idea?<br data-mce-bogus=3D"1"></span></div><div><span style=
=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"font=
-size: 11pt; font-family: terminal, monaco;"><br data-mce-bogus=3D"1"></spa=
n></div><div><span style=3D"font-size: 11pt; font-family: terminal, monaco;=
" data-mce-style=3D"font-size: 11pt; font-family: terminal, monaco;">Thanks=
<br data-mce-bogus=3D"1"></span></div><div><span style=3D"font-size: 11pt; =
font-family: terminal, monaco;" data-mce-style=3D"font-size: 11pt; font-fam=
ily: terminal, monaco;"><br data-mce-bogus=3D"1"></span></div><div><span st=
yle=3D"font-size: 11pt; font-family: terminal, monaco;" data-mce-style=3D"f=
ont-size: 11pt; font-family: terminal, monaco;"><br data-mce-bogus=3D"1"></=
span></div><br><div>-- <br></div><div><hr style=3D"width: 100%; height: 2px=
;" data-mce-style=3D"width: 100%; height: 2px;">Jose Ferradeira<br>http://w=
ww.logicworks.pt</div></div><br></div></div></body></html>
------=_Part_20329569_1409874801.1519819859027--
5 years, 11 months
Out of sync, hosts network config differs from DC
by femi adegoke
Property: default route, host - true, DC - false
I have 4 nics.
bond0 = 2 x 10g
eno1 = ovirtmgmt
eno2 = for vm traffic.
eno2 says it's out of sync, hosts network config differs from DC,
default route: host - true, DC - false.
I have tried "sync all networks" but the message still remains
See attached.
Where can I look to fix the issue?
5 years, 11 months
oVirt Node on CentOS 7.5 and AMD EPYC Support
by Tobias Scheinert
Hi,
I am currently building a new virtualization cluster with oVirt, using
AMD EPYC processors (AMD EPYC 7351P). At the moment I'm running oVirt
Node Version 4.2.3 @ CentOS 7.4.1708.
We have the situation that the processor type is recognized as "AMD
Opteron G3". With this type of instruction set the VMs are not able to
do AES in hardware, this results in poor performance in our case.
I found some information that tells me that this problem should be
solved with CentOS 7.5
--> <https://access.redhat.com/errata/RHEA-2018:1488>
My actual questions:
- Are there any further information about the AMD EPYC support?
- Any information about an update of the oVirt node to CentOS 7.5?
Greeting Tobias
5 years, 12 months
VLAN tagging with external provider networks
by anurag.porripireddi@bigswitch.com
Hi,
I see that checking the "External provider" box whilst creating networks clears and grays out VLAN tagging. Is VLAN tagging for external provider networks not supported?
Thanks,
Anurag
6 years, 1 month
Info on HCI single host
by Gianluca Cecchi
Hello,
following this reference guide:
https://www.ovirt.org/documentation/gluster-hyperconverged/chap-Single_no...
after having run with success the gdeploy based gluster setup, there is
Setting up Hosted Engine
Use the Ansible based installation flow of Hosted Engine to set up oVirt
within a virtual machine. The storage details should be provided as type:
glusterfs and connection path as: <hostname>:/engine (Replace hostname with
address of host on which installation is carried out)
What does exactly mean "Use the Ansible based instalation" ? Does it mean
using cockpit web ui? In this casae I suppose I have to choose:
Hosted Engine
Deploy oVirt hosted engine on storage that has already been provisioned
Correct?
Thanks,
Gianluca
6 years, 1 month
[OT] transferring kvm raw mage from filesystem to lvm based
by Gianluca Cecchi
Hello,
I have a windows 7 VM that is composed by a raw disk on filesystem on
Fedora 28.
I would like to transfer this VM to another environment, always based on
same version of Fedora 28, but where the storage is configured in
virt-manager as lvm based, so similar to what happens in oVirt block based
storage domains.
I'm trying to figure how to transfer the image.
I presume I can create a new LV and then leave some bytes for the LVM
header into the created LV and then make a sort of dd with offset?
Anyone has any hint?
Thanks in advance,
Gianluca
6 years, 1 month
Slow vm transfer speed from vmware esxi 5
by Bernhard Dick
Hi,
currently I'm trying to move VMs from our vsphere 5 environment to
oVirt. While the io performance on oVirt and on the esxi platform is
quite well (about 100MByte/sec on a 1GBit storage link) the transfer
speed using the integrated v2v-feature is very slow (only 10MByte/sec).
That would result in transfer time of >24h for some machines.
Do you have any ideas how I can improve the transfer speed?
Regards
Bernhard Dick
6 years, 2 months
Error installing ovirt node
by Junaid Jadoon
Dear All,
I have Ovirt engine 4.2 and node version is 4.2.
After installing node in in ovirt engine when i try to install node it
gives following error
14:25:37,410+05 ERROR
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
(EE-ManagedThreadFactory-engine-Thread-19) [52669850] Host installation
failed for host 'bd8d007a-be92-4075-bba9-6cbeb890a1e5', 'node_2': Command
returned failure code 1 during SSH session 'root(a)192.168.20.20'
2018-02-27 14:25:37,416+05 INFO
[org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand]
(EE-ManagedThreadFactory-engine-Thread-19) [52669850] START,
SetVdsStatusVDSCommand(HostName = node_2,
SetVdsStatusVDSCommandParameters:{hostId='bd8d007a-be92-4075-bba9-6cbeb890a1e5',
status='InstallFailed', nonOperationalReason='NONE',
stopSpmFailureLogged='false', maintenanceReason='null'}), log id: 2b138e87
2018-02-27 14:25:37,423+05 INFO
[org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand]
(EE-ManagedThreadFactory-engine-Thread-19) [52669850] FINISH,
SetVdsStatusVDSCommand, log id: 2b138e87
2018-02-27 14:25:37,429+05 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(EE-ManagedThreadFactory-engine-Thread-19) [52669850] EVENT_ID:
VDS_INSTALL_FAILED(505), Host node_2 installation failed. Command returned
failure code 1 during SSH session 'root(a)192.168.20.20'.
2018-02-27 14:25:37,433+05 INFO
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
(EE-ManagedThreadFactory-engine-Thread-19) [52669850] Lock freed to object
'EngineLock:{exclusiveLocks='[bd8d007a-be92-4075-bba9-6cbeb890a1e5=VDS]',
sharedLocks=''}'
I have attached log file for your reference
Please help me out.
6 years, 2 months