Hi Diggy,
I'm not sure if it's an oVirt issue, but it can be a network or firewall issue.
Did you test the connection between oVirt hosts and the iLO interfaces?
Simple tests like ping to ensure one host can reach others iLO interfaces and ipmitool to
ensure you can connect to the management interfaces?
Marcos
-----Original Message-----
From: Diggy Mc <d03(a)bornfree.org>
Sent: quinta-feira, 30 de dezembro de 2021 15:02
To: users(a)ovirt.org
Subject: [External] : [ovirt-users] Unrecoverable NMI error on HP Gen8 hosts.
I have oVirt Node v4.4.8.3 running on several HP ProLiant Gen8 servers. I receive the
following error under certain circumstances:
"An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout
NMI, Service Information: 0x0000002B, 0x00000000)"
When a host starts taking a load (but nowhere near a threshold), I encounter the above
iLO-logged error and the host locks-up. I have had to grossly under-utilize my hosts to
avoid this problem. I'm hoping for a better fix or work-around.
I've had the same problem beginning with my oVirt 4.3.x hosts, so it isn't oVirt
version specific.
The little information I could find on the error wasn't helpful. Red Hat acknowledges
the issue, but limited to shutdown/reboot operations; not during "normal"
operations.
Anyone else experienced this problem? How did you fix it or work around it? I'd like
to better utilize my servers if possible.
In advance, thank you to anyone and everyone who offers help.
_______________________________________________
Users mailing list -- users(a)ovirt.org
To unsubscribe send an email to users-leave(a)ovirt.org Privacy Statement:
https://urldefense.com/v3/__https://www.ovirt.org/privacy-policy.html__;!...
oVirt Code of Conduct:
https://urldefense.com/v3/__https://www.ovirt.org/community/about/communi...
List Archives:
https://urldefense.com/v3/__https://lists.ovirt.org/archives/list/users@o...