I was testing fence kdump integration.
Due to the following issues I was not able to test over JSON RPC and all tests were done
with XML RPC
1) Manually applied :
http://gerrit.ovirt.org/#/c/32843 (this was not included yet
in 3.5 and cause an exception in VDSM when fenceNode is called)
2) JSON : does not return status correctly : Test Succeeded, null
Therefor , can not stop/start/restart host
Related to engine patch :
http://gerrit.ovirt.org/#/c/32855/
not back-ported yet to 3.5 (blocker IMO)
--------------------------------------------------------------
ALL tests below should be redone after resolving JSON issues !
--------------------------------------------------------------
-------------------
General comment
-------------------
All fence kdump flows are related to a scenario in which host became non responsive.
In the case of manual PM action (stop/restart) fence kdump is not taking place.
I think this should be fully documented in order to prevent misunderstanding when the
kdump flag
is checked and host is rebooted/stopped manual while kdumping' which will result in
losing the dump file.
*********************************
Tests using XML RPC
*********************************
------------------------
Installation tests :
------------------------
1) Adding a host with Detect kdump flow set to on and without crashkernel command line
parameter
Result: host installation is OK, but warning message is displayed in Events tab and
Audit log
TEST PASSED
2) Adding a host with Detect kdump flow set to on, with crashkernel command line
parameter, but without
required version of kexec-tools package
Result: host installation is OK, but warning message is displayed in Events tab and
Audit log
TEST PASSED
3) Adding a host with Detect kdump flow set to on, with crashkernel command line parameter
and with required
version of kexec-tools package
Result: host installation is OK, in General tab of host detail view you should see
Kdump Status: Enabled
TEST PASSED
------------------------
Kdump detection tests:
------------------------
1) Crashdumping a host with kdump detection disabled
Prerequisites: host was successfully deployed with Detect kdump flow set to off,
fence_kdump listener
is running
Result: Host changes its status Up -> Connecting -> Non Responsive -> Reboot
-> Non Responsive -> Up,
hard fencing is executed
TEST PASSED
2) Crashdumping a host with kdump detection enabled
Prerequisites: host was successfully deployed with Detect kdump flow set to on,
fence_kdump listener
is running
Result: Host changes its status Up -> Connecting -> Non Responsive ->
Kdumping -> Non Responsive -> Up,
hard fencing is not executed, there are messages in Events tab Kdump flow detected on
host and Kdump
flow finished on host
TEST PASSED
3) Crashdumping a host with kdump detection enabled but fence_kdump listener down
Prerequisites: host was successfully deployed with Detect kdump flow set to on,
fence_kdump listener
is not running
Result: Host changes its status Up -> Connecting -> Non Responsive -> Reboot
-> Non Responsive -> Up,
hard fencing is executed, there's message in Events tab Kdump detection for host
had started,
but fence_kdump listener is not running
TEST PASSED
4) Host with kdump detection enabled, fence_kdump listener is running, but network between
engine
and host is down
Prerequisites: host was successfully deployed with Detect kdump flow set to on,
fence_kdump listener
is running, alter firewall rules on engine to drop everything coming from host's
IP address
Result: Host changes its status Up -> Connecting -> Non Responsive -> Reboot
-> Non Responsive -> Up,
hard fencing is executed, there's message in Events tab Kdump flow not detected on
host
TEST PASSED
5) Crashdumping a host with kdump detection enabled, fence_kdump listener is running, stop
fence_kdump
listener during kdump
Prerequisites: host was successfully deployed with Detect kdump flow set to on,
fence_kdump listener
is running
Actions: When host status is changed to Kdumping, stop fence_kdump listener
Result: Host changes its status Up -> Connecting -> Non Responsive ->
Kdumping -> Reboot -> Non Responsive
-> Up, hard fencing is executed, there are messages in Events tab Kdump flow
detected on host and Kdump
detection for host had started, but fence_kdump listener is not running
TEST PASSED I got this message in event log :
Unable to determine if Kdump is in progress on host
'pluto-vdsf', because fence_kdump listener is not running.
Is this OK ?
6) Crashdumping a host with kdump detection enabled, fence_kdump listener is running,
restart engine during kdump
Prerequisites: host was successfully deployed with Detect kdump flow set to on,
fence_kdump listener is running
Actions: When host status is changed to Kdumping, restart engine
Result: Host changes its status Up -> Connecting -> Non Responsive ->
Kdumping, hard fencing is not
executed, there are messages in Events tab Kdump flow detected on host, after engine
restart host
stays in Kdumping status for the period of DisableFenceAtStartupInSec seconds, after
that there
are messages in Events tab Kdump flow detected on host and Kdump flow finished on host
and
changes status Kdumping -> Non Responsive -> Up
TEST PASSED I got only this message in event log :
Kdump flow is in progress on host 'pluto-vdsf'.
Is this OK?
Thanks
Eli Mesika