[ovirt-users] power outage: HA vms not restarted

Yuriy Demchenko demchenko.ya at gmail.com
Mon May 19 08:34:15 UTC 2014


Hi,

i'm running ovirt-3.2.2-el6 on 18 el6 hosts with FC san storage, 46 HA 
vms in 2 datacenters (3 hosts uses different storage with no 
connectivity to first storage, that's why second DC)
Recently (2014-05-17) i had a double power outage: first blackout at 
00:16, went back at ~00:19, second blackout at 00:26, went back at 10:06
When finally all went up (after approx. 10:16) - only 2 vms were 
restarted from 46.
 From browsing engine log i saw failed restart attemts of almost all vms 
after first blackout with error 'Failed with error ENGINE and code 
5001', but after second blackout i saw no attempts to restart vms, and 
only error was 'connect timeout' (probably to srv5 - that host 
physically died after blackouts).
And i cant figure why HA vms were not restarted? Please advice

engine and (supposedly) spm host logs in attach.

-- 
Yuriy Demchenko

-------------- next part --------------
A non-text attachment was scrubbed...
Name: engine.log.17052014.gz
Type: application/gzip
Size: 488457 bytes
Desc: not available
URL: <http://lists.ovirt.org/pipermail/users/attachments/20140519/227b13ce/attachment-0002.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: vdsm.log.10.gz
Type: application/gzip
Size: 1177518 bytes
Desc: not available
URL: <http://lists.ovirt.org/pipermail/users/attachments/20140519/227b13ce/attachment-0003.bin>


More information about the Users mailing list