--Apple-Mail=_888E7FA5-6914-4728-8646-862772EBB5CF
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
charset=utf-8
On 16 Sep 2016, at 14:23, Martin Perina <mperina(a)redhat.com>
wrote:
=20
=20
=20
On Fri, Sep 16, 2016 at 1:54 PM, Simone Tiraboschi =
<stirabos(a)redhat.com
<mailto:stirabos@redhat.com>> wrote:
=20
=20
On Fri, Sep 16, 2016 at 12:50 PM, Martin Perina <mperina(a)redhat.com =
<mailto:mperina@redhat.com>> wrote:
=20
=20
On Fri, Sep 16, 2016 at 9:26 AM, Michal Skrivanek =
<michal.skrivanek(a)redhat.com
<mailto:michal.skrivanek@redhat.com>> =
wrote:
=20
> On 16 Sep 2016, at 08:29, aleksey.maksimov(a)it-kb.ru =
<mailto:aleksey.maksimov@it-kb.ru> wrote:
>
> There are more ideas?
>
> 15.09.2016, 14:40, "aleksey.maksimov(a)it-kb.ru =
<mailto:aleksey.maksimov@it-kb.ru>" <aleksey.maksimov(a)it-kb.ru =
<mailto:aleksey.maksimov@it-kb.ru>>:
>> Martin, I physically turned off the server through the iLO2.
See =
screenshots.
>> I did not touch Virtual Machine (KOM-AD01-PBX02) at the same
time.
>> The virtual machine has been turned on at the time when the host =
shut
down.
>>
>> 15.09.2016, 14:27, "Martin Perina" <mperina(a)redhat.com =
<mailto:mperina@redhat.com>>:
>>> Hi,
>>>
>>> I found out this in the log:
>>>
>>> 2016-09-15 12:02:04,661 INFO =
[org.ovirt.engine.core.vdsbroker.monitoring.VmAnalyzer] =
(ForkJoinPool-1-worker-6) [] VM =
'660bafca-e9c3-4191-99b4-295ff8553488'(KOM-AD01-PBX02) moved from 'Up' =
--> 'Down'
>>> 2016-09-15 12:02:04,788 INFO =
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] =
(ForkJoinPool-1-worker-6) [] Correlation ID: null, Call Stack: null, =
Custom Event ID: -1, Message: VM KOM-AD01-PBX02 is down. Exit message: =
User shut down from within the guest
=20
since it shut down cleanly, can you please check the guest's logs to =
see what
triggered the shutdown? In such cases it is considered a user =
requested shutdown and such VMs are not restarted automatically
=20
=E2=80=8BThat's exactly what I meant by my response. =46rom the log =
it's
obvious that VM was shutdown properly, so engine will not restart =
it on a different. host. Also on most modern hosts if you execute power =
management off action, a signal is sent to OS to execute =E2=80=8B =
=E2=80=8Bregular shutdown so VMs are also shutted down properly.
=20
I understand the reason, but is it really what the user expects?
=20
I mean, if I set HA mode on a VM I'd expect the that the engine cares =
to keep
it up of restart if needed regardless of shutdown reasons.
no, that=E2=80=99s not how HA works today. When you log into a guest and =
issue =E2=80=9Cshutdown=E2=80=9D we do not restart the VM under your =
hands. We can argue how it should or may work, but this is the defined =
behavior since the dawn of oVirt.
=20
=E2=80=8BAFAIK that's correct, we need to be able =E2=80=8B=E2=80=8Bshut=
down
HA VM=E2=80=8B=E2=80=8B=E2=80=8B without being it immediately =
restarted on different host. We want to restart HA VM only if host, =
where HA VM is running, is non-responsive.
we try to restart it in all other cases other than user initiated =
shutdown, e.g. a QEMU process crash on an otherwise-healthy host
=20
For instance, on hosted-engine the HA agent, if not in global =
maintenance mode,
will restart the engine VM regardless of who or why it =
went off.
=20
=E2=80=8BWell, HE VM is definitely not a standard HA VM :-)
=E2=80=8B=20
=20
=20
=E2=80=8B
We are aware of a similar issue on specific hw - =
https://bugzilla.redhat.com/show_bug.cgi?id=3D1341106 =
<
https://bugzilla.redhat.com/show_bug.cgi?id=3D1341106>
=20
>>>
>>> If I'm not mistaken, this means that VM was properly shutted down =
from within itself and in that case it's not restarted automatically. So =
I'm curious what actions have you made to make host KOM-AD01-VM31 =
non-responsive?
>>>
>>> If you want to test fencing properly, then I suggest you to =
either
block connection between host and engine on host side and =
forcibly stop ovirtmgmt network interface on host and watch fencing is =
applied.
=20
=E2=80=8BTry above if you want to test fencing. Of course you can =
always
configure firewall rule to drop all packets between engine and =
host or unplug host network cable=E2=80=8B.
=20
>>>
>>> Martin
>>>
>>> On Thu, Sep 15, 2016 at 1:16 PM, <aleksey.maksimov(a)it-kb.ru =
<mailto:aleksey.maksimov@it-kb.ru>> wrote:
>>>> engine.log for this period.
>>>>
>>>> 15.09.2016, 14:01, "Martin Perina" <mperina(a)redhat.com =
<mailto:mperina@redhat.com>>:
>>>>> On Thu, Sep 15, 2016 at 12:47 PM,
<aleksey.maksimov(a)it-kb.ru =
<mailto:aleksey.maksimov@it-kb.ru>> wrote:
>>>>>> Hi Martin.
>>>>>> I have a stupid question. Use Watchdog device mandatory to =
automatically start a virtual machine in host Fencing process?
>>>>>
>>>>> =E2=80=8BAFAIK it's not, but I'm not na expert, adding
Arik.
>>>>>
>>>>> You need correct power management setup for the hosts and VM =
has to be marked as highly available=E2=80=8B for sure.
>>>>>
>>>>>> 15.09.2016, 13:43, "Martin Perina"
<mperina(a)redhat.com =
<mailto:mperina@redhat.com>>:
>>>>>>> Hi,
>>>>>>>
>>>>>>> could you please share whole engine.log?
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>> Martin Perina
>>>>>>>
>>>>>>> On Thu, Sep 15, 2016 at 12:01 PM,
<aleksey.maksimov(a)it-kb.ru =
<mailto:aleksey.maksimov@it-kb.ru>> wrote:
>>>>>>>> Hello oVirt guru`s !
>>>>>>>>
>>>>>>>> I have oVirt Hosted Engine 4.0.3-1.el7.centos on two
CentOS =
7.2 hosts (HP ProLiant DL 360 G5) connected to shared FC SAN Storage.
>>>>>>>>
>>>>>>>> 1. I configured Power Management for the Hosts
(successfully =
added Fencing Agent for iLO2 from my hosts)
>>>>>>>>
>>>>>>>> 2. I created new VM (KOM-AD01-PBX02) and installed
Guest OS =
(Ubuntu Server 16.04 LTS) and oVirt Guest Agent
>>>>>>>> (As described herein =
https://blog.it-kb.ru/2016/09/14/install-ovirt-4-0-part-2-about-data-cente=
r-iso-domain-logical-network-vlan-vm-settings-console-guest-agent-live-mig=
ration/ =
<
https://blog.it-kb.ru/2016/09/14/install-ovirt-4-0-part-2-about-data-cent=
er-iso-domain-logical-network-vlan-vm-settings-console-guest-agent-live-mi=
gration/>)
>>>>>>>> In VM settings on "High
Availability" I turned on the =
option "Highly Available" and change
"Priority" to "High"
>>>>>>>>
>>>>>>>> 3. Now I'm trying to check Hard-Fencing and power
off my =
first host (KOM-AD01-VM31) from his iLO (KOM-AD01-ILO31).
>>>>>>>>
>>>>>>>> Fencing successfully works and server is automatically
=
turned on, but my HA VM not started on second host (KOM-AD01-VM32).
>>>>>>>>
>>>>>>>> These events I see in the oVirt web console:
>>>>>>>>
>>>>>>>> Sep 15, 2016 12:08:13 PM Host KOM-AD01-VM31
power =
management was verified successfully.
>>>>>>>> Sep 15, 2016 12:08:13 PM
Status of host KOM-AD01-VM31 =
was set to Up.
>>>>>>>> Sep 15, 2016 12:08:05 PM
Executing power management =
status on Host KOM-AD01-VM31 using Proxy Host
KOM-AD01-VM32 and Fence =
Agent
ilo:KOM-AD01-ILO31.holding.com =
<
http://kom-ad01-ilo31.holding.com/>.
>>>>>>>> Sep 15, 2016 12:05:48 PM
Host KOM-AD01-VM31 is =
rebooting.
>>>>>>>> Sep 15, 2016 12:05:48 PM
Host KOM-AD01-VM31 was =
started by SYSTEM.
>>>>>>>> Sep 15, 2016 12:05:48 PM
Power management start of =
Host KOM-AD01-VM31 succeeded.
>>>>>>>> Sep 15, 2016 12:05:41 PM
Executing power management =
status on Host KOM-AD01-VM31 using Proxy Host
KOM-AD01-VM32 and Fence =
Agent
ilo:KOM-AD01-ILO31.holding.com =
<
http://kom-ad01-ilo31.holding.com/>.
>>>>>>>> Sep 15, 2016 12:05:19 PM
Executing power management =
start on Host KOM-AD01-VM31 using Proxy Host
KOM-AD01-VM32 and Fence =
Agent
ilo:KOM-AD01-ILO31.holding.com =
<
http://kom-ad01-ilo31.holding.com/>.
>>>>>>>> Sep 15, 2016 12:05:19 PM
Power management start of =
Host KOM-AD01-VM31 initiated.
>>>>>>>> Sep 15, 2016 12:05:19 PM
Auto fence for host =
KOM-AD01-VM31 was started.
>>>>>>>> Sep 15, 2016 12:05:11 PM
Executing power management =
status on Host KOM-AD01-VM31 using Proxy Host
KOM-AD01-VM32 and Fence =
Agent
ilo:KOM-AD01-ILO31.holding.com =
<
http://kom-ad01-ilo31.holding.com/>.
>>>>>>>> Sep 15, 2016 12:05:04 PM
Executing power management =
status on Host KOM-AD01-VM31 using Proxy Host
KOM-AD01-VM32 and Fence =
Agent
ilo:KOM-AD01-ILO31.holding.com =
<
http://kom-ad01-ilo31.holding.com/>.
>>>>>>>> Sep 15, 2016 12:05:04 PM
Host KOM-AD01-VM31 is non =
responsive.
>>>>>>>> Sep 15, 2016 12:02:32 PM
Host KOM-AD01-VM31 is not =
responding. It will stay in Connecting state for a grace
period of 60 =
seconds and after that an attempt to fence the host will be issued.
>>>>>>>> Sep 15, 2016 12:02:32 PM
VDSM KOM-AD01-VM31 command =
failed: Heartbeat exeeded
>>>>>>>> Sep 15, 2016 12:02:04 PM VM
KOM-AD01-PBX02 is down. =
Exit message: User shut down from within the guest
>>>>>>>>
>>>>>>>> What am I doing wrong? Why HA VM not start on a second
host?
>>>>>>>> _______________________________________________
>>>>>>>> Users mailing list
>>>>>>>> Users(a)ovirt.org <mailto:Users@ovirt.org>
>>>>>>>>
http://lists.ovirt.org/mailman/listinfo/users =
<
http://lists.ovirt.org/mailman/listinfo/users>
> _______________________________________________
> Users mailing list
> Users(a)ovirt.org <mailto:Users@ovirt.org>
>
http://lists.ovirt.org/mailman/listinfo/users =
<
http://lists.ovirt.org/mailman/listinfo/users>
>
>
=20
=20
=20
_______________________________________________
Users mailing list
Users(a)ovirt.org <mailto:Users@ovirt.org>
http://lists.ovirt.org/mailman/listinfo/users =
<
http://lists.ovirt.org/mailman/listinfo/users>
=20
=20
=20
--Apple-Mail=_888E7FA5-6914-4728-8646-862772EBB5CF
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
charset=utf-8
<html><head><meta http-equiv=3D"Content-Type"
content=3D"text/html =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" =
class=3D""><br class=3D""><div><blockquote
type=3D"cite" class=3D""><div =
class=3D"">On 16 Sep 2016, at 14:23, Martin Perina <<a =
href=3D"mailto:mperina@redhat.com"
class=3D"">mperina(a)redhat.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><div
class=3D""><div =
dir=3D"ltr" class=3D""><div class=3D"gmail_default" =
style=3D"font-family:arial,helvetica,sans-serif"><br
class=3D""></div><div=
class=3D"gmail_extra"><br class=3D""><div
class=3D"gmail_quote">On Fri, =
Sep 16, 2016 at 1:54 PM, Simone Tiraboschi <span dir=3D"ltr" =
class=3D""><<a href=3D"mailto:stirabos@redhat.com"
target=3D"_blank" =
class=3D"">stirabos(a)redhat.com</a>&gt;</span> wrote:<br =
class=3D""><blockquote class=3D"gmail_quote"
style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" =
class=3D""><br class=3D""><div
class=3D"gmail_extra"><br class=3D""><div =
class=3D"gmail_quote">On Fri, Sep 16, 2016 at 12:50 PM, Martin Perina =
<span dir=3D"ltr" class=3D""><<a
href=3D"mailto:mperina@redhat.com" =
target=3D"_blank"
class=3D"">mperina(a)redhat.com</a>&gt;</span> wrote:<br =
class=3D""><blockquote class=3D"gmail_quote"
style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" =
class=3D""><div style=3D"font-family:arial,helvetica,sans-serif"
=
class=3D""><br class=3D""></div><div
class=3D"gmail_extra"><br =
class=3D""><div class=3D"gmail_quote"><span
class=3D"">On Fri, Sep 16, =
2016 at 9:26 AM, Michal Skrivanek <span dir=3D"ltr"
class=3D""><<a =
href=3D"mailto:michal.skrivanek@redhat.com" target=3D"_blank" =
class=3D"">michal.skrivanek(a)redhat.com</a>&gt;</span>
wrote:<br =
class=3D""><blockquote class=3D"gmail_quote"
style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><br class=3D"">
> On 16 Sep 2016, at 08:29, <a =
href=3D"mailto:aleksey.maksimov@it-kb.ru" target=3D"_blank" =
class=3D"">aleksey.maksimov(a)it-kb.ru</a> wrote:<br
class=3D"">
><br class=3D"">
> There are more ideas?<br class=3D"">
><br class=3D"">
> 15.09.2016, 14:40, "<a
href=3D"mailto:aleksey.maksimov@it-kb.ru" =
target=3D"_blank"
class=3D"">aleksey.maksimov(a)it-kb.ru</a>" <<a =
href=3D"mailto:aleksey.maksimov@it-kb.ru" target=3D"_blank" =
class=3D"">aleksey.maksimov@it-kb.ru</a>>:<br
class=3D"">
>> Martin, I physically turned off the server through the iLO2. =
See screenshots.<br class=3D"">
>> I did not touch Virtual Machine (KOM-AD01-PBX02) at the same =
time.<br class=3D"">
>> The virtual machine has been turned on at the time when the =
host shut down.<br class=3D"">
>><br class=3D"">
>> 15.09.2016, 14:27, "Martin Perina" <<a =
href=3D"mailto:mperina@redhat.com" target=3D"_blank" =
class=3D"">mperina@redhat.com</a>>:<br
class=3D"">
>>> Hi,<br class=3D"">
>>><br class=3D"">
>>> I found out this in the log:<br
class=3D"">
>>><br class=3D"">
>>> 2016-09-15 12:02:04,661 INFO =
[org.ovirt.engine.core.vdsbrok<wbr class=3D"">er.monitoring.VmAnalyzer] =
(ForkJoinPool-1-worker-6) [] VM '660bafca-e9c3-4191-99b4-295ff<wbr =
class=3D"">8553488'(KOM-AD01-PBX02) moved from 'Up' -->
'Down'<br =
class=3D"">
>>> 2016-09-15 12:02:04,788 INFO =
[org.ovirt.engine.core.dal.dbb<wbr =
class=3D"">roker.auditloghandling.AuditLo<wbr
class=3D"">gDirector] =
(ForkJoinPool-1-worker-6) [] Correlation ID: null, Call Stack: null, =
Custom Event ID: -1, Message: VM KOM-AD01-PBX02 is down. Exit message: =
User shut down from within the guest<br class=3D"">
<br class=3D"">
since it shut down cleanly, can you please check the guest's logs to see =
what triggered the shutdown? In such cases it is considered a user =
requested shutdown and such VMs are not restarted automatically<br =
class=3D""></blockquote></span><div
class=3D""><br class=3D""><div =
style=3D"font-family:arial,helvetica,sans-serif;display:inline" =
class=3D"">=E2=80=8BThat's exactly what I meant by my response. =46rom =
the log it's obvious that VM was shutdown properly, so engine will not =
restart it on a different. host. Also on most modern hosts if you =
execute power management off action, a signal is sent to OS to execute =
=E2=80=8B</div> <div =
style=3D"font-family:arial,helvetica,sans-serif;display:inline" =
class=3D"">=E2=80=8Bregular shutdown so VMs are also shutted down =
properly.<br
class=3D""></div></div></div></div></div></blockquote><div
=
class=3D""><br class=3D""></div><div
class=3D"">I understand the reason, =
but is it really what the user expects?</div><div class=3D""><br
=
class=3D""></div><div class=3D"">I mean, if I set HA
mode on a VM I'd =
expect the that the engine cares to keep it up of restart if needed =
regardless of shutdown =
reasons.</div></div></div></div></blockquote></div></div></div></div></blo=
ckquote><div><br class=3D""></div>no, that=E2=80=99s not
how HA works =
today. When you log into a guest and issue =E2=80=9Cshutdown=E2=80=9D we =
do not restart the VM under your hands. We can argue how it should or =
may work, but this is the defined behavior since the dawn of =
oVirt.</div><div><br class=3D""><blockquote
type=3D"cite" class=3D""><div =
class=3D""><div dir=3D"ltr" class=3D""><div
class=3D"gmail_extra"><div =
class=3D"gmail_quote"><div class=3D""><br
class=3D""><div =
class=3D"gmail_default" =
style=3D"font-family:arial,helvetica,sans-serif;display:inline">=E2=80=8BA=
FAIK that's correct, we need to be able =E2=80=8B</div><div =
class=3D"gmail_default" =
style=3D"font-family:arial,helvetica,sans-serif;display:inline">=E2=80=8Bs=
hutdown HA VM=E2=80=8B</div>=E2=80=8B<div class=3D"gmail_default" =
style=3D"font-family:arial,helvetica,sans-serif;display:inline">=E2=80=8B =
without being it immediately restarted on different host. We want to =
restart HA VM only if host, where HA VM is running, is =
non-responsive.<br =
class=3D""></div></div></div></div></div></div></blockquote><div><br
=
class=3D""></div>we try to restart it in all other cases other than
user =
initiated shutdown, e.g. a QEMU process crash on an otherwise-healthy =
host</div><div><br class=3D""><blockquote
type=3D"cite" class=3D""><div =
class=3D""><div dir=3D"ltr" class=3D""><div
class=3D"gmail_extra"><div =
class=3D"gmail_quote"><div class=3D""><div
class=3D"gmail_default" =
style=3D"font-family:arial,helvetica,sans-serif;display:inline"><br =
class=3D""></div></div><blockquote
class=3D"gmail_quote" style=3D"margin:0=
0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div
dir=3D"ltr" =
class=3D""><div class=3D"gmail_extra"><div
class=3D"gmail_quote"><div =
class=3D"">For instance, on hosted-engine the HA agent, if not in global =
maintenance mode, will restart the engine VM regardless of who or why it =
went off.</div></div></div></div></blockquote><div
class=3D""><br =
class=3D""><div class=3D"gmail_default" =
style=3D"font-family:arial,helvetica,sans-serif;display:inline">=E2=80=8BW=
ell, HE VM is definitely not a standard HA VM :-)<br =
class=3D"">=E2=80=8B</div> </div><blockquote
class=3D"gmail_quote" =
style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex"><div dir=3D"ltr"
class=3D""><div =
class=3D"gmail_extra"><div class=3D"gmail_quote"><div
class=3D""><br =
class=3D""></div><div
class=3D""> </div><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex"><div dir=3D"ltr"
class=3D""><div =
class=3D"gmail_extra"><div class=3D"gmail_quote"><div
class=3D""><div =
style=3D"font-family:arial,helvetica,sans-serif;display:inline" =
class=3D"">=E2=80=8B</div></div><span
class=3D""><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">
We are aware of a similar issue on specific hw - <a =
href=3D"https://bugzilla.redhat.com/show_bug.cgi?id=3D1341106" =
rel=3D"noreferrer" target=3D"_blank" =
class=3D"">https://bugzilla.redhat.com/sh<wbr =
class=3D"">ow_bug.cgi?id=3D1341106</a><br class=3D"">
<br class=3D"">
>>><br class=3D"">
>>> If I'm not mistaken, this means that VM was
properly =
shutted down from within itself and in that case it's not restarted =
automatically. So I'm curious what actions have you made to make host =
KOM-AD01-VM31 non-responsive?<br class=3D"">
>>><br class=3D"">
>>> If you want to test fencing properly, then I suggest =
you to either block connection between host and engine on host side and =
forcibly stop ovirtmgmt network interface on host and watch fencing is =
applied.<br class=3D""></blockquote></span><div
class=3D""><br =
class=3D""><div =
style=3D"font-family:arial,helvetica,sans-serif;display:inline" =
class=3D"">=E2=80=8BTry above if you want to test fencing. Of course you =
can always configure firewall rule to drop all packets between engine =
and host or unplug host network cable=E2=80=8B.<br class=3D""><br =
class=3D""></div></div><div class=3D""><div
class=3D""><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">
>>><br class=3D"">
>>> Martin<br class=3D"">
>>><br class=3D"">
>>> On Thu, Sep 15, 2016 at 1:16 PM, <<a =
href=3D"mailto:aleksey.maksimov@it-kb.ru" target=3D"_blank" =
class=3D"">aleksey.maksimov(a)it-kb.ru</a>&gt; wrote:<br
class=3D"">
>>>> engine.log for this period.<br
class=3D"">
>>>><br class=3D"">
>>>> 15.09.2016, 14:01, "Martin Perina"
<<a =
href=3D"mailto:mperina@redhat.com" target=3D"_blank" =
class=3D"">mperina@redhat.com</a>>:<br
class=3D"">
>>>>> On Thu, Sep 15, 2016 at 12:47 PM,
<<a =
href=3D"mailto:aleksey.maksimov@it-kb.ru" target=3D"_blank" =
class=3D"">aleksey.maksimov(a)it-kb.ru</a>&gt; wrote:<br
class=3D"">
>>>>>> Hi Martin.<br
class=3D"">
>>>>>> I have a stupid question. Use
Watchdog =
device mandatory to automatically start a virtual machine in host =
Fencing process?<br class=3D"">
>>>>><br class=3D"">
>>>>> =E2=80=8BAFAIK it's not, but
I'm not na =
expert, adding Arik.<br class=3D"">
>>>>><br class=3D"">
>>>>> You need correct power management setup
for =
the hosts and VM has to be marked as highly available=E2=80=8B for =
sure.<br class=3D"">
>>>>><br class=3D"">
>>>>>> 15.09.2016, 13:43, "Martin
Perina" <<a =
href=3D"mailto:mperina@redhat.com" target=3D"_blank" =
class=3D"">mperina@redhat.com</a>>:<br
class=3D"">
>>>>>>> Hi,<br
class=3D"">
>>>>>>><br class=3D"">
>>>>>>> could you please share
whole =
engine.log?<br class=3D"">
>>>>>>><br class=3D"">
>>>>>>> Thanks<br
class=3D"">
>>>>>>><br class=3D"">
>>>>>>> Martin Perina<br
class=3D"">
>>>>>>><br class=3D"">
>>>>>>> On Thu, Sep 15, 2016 at
12:01 PM, =
<<a href=3D"mailto:aleksey.maksimov@it-kb.ru"
target=3D"_blank" =
class=3D"">aleksey.maksimov(a)it-kb.ru</a>&gt; wrote:<br
class=3D"">
>>>>>>>> Hello oVirt
guru`s !<br class=3D"">=
>>>>>>>><br
class=3D"">
>>>>>>>> I have oVirt
Hosted Engine =
4.0.3-1.el7.centos on two CentOS 7.2 hosts (HP ProLiant DL 360 G5) =
connected to shared FC SAN Storage.<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> 1. I configured
Power Management =
for the Hosts (successfully added Fencing Agent for iLO2 from my =
hosts)<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> 2. I created
new VM =
(KOM-AD01-PBX02) and installed Guest OS (Ubuntu Server 16.04 LTS) and =
oVirt Guest Agent<br class=3D"">
>>>>>>>> (As described
herein <a =
href=3D"https://blog.it-kb.ru/2016/09/14/install-ovirt-4-0-part-2-about-da=
ta-center-iso-domain-logical-network-vlan-vm-settings-console-guest-agent-=
live-migration/" rel=3D"noreferrer" target=3D"_blank" =
class=3D"">https://blog.it-kb.ru/2016/09/<wbr =
class=3D"">14/install-ovirt-4-0-part-2-ab<wbr =
class=3D"">out-data-center-iso-domain-log<wbr =
class=3D"">ical-network-vlan-vm-settings-<wbr =
class=3D"">console-guest-agent-live-migra<wbr
class=3D"">tion/</a>)<br =
class=3D"">
>>>>>>>>
In VM settings on =
"High Availability" I turned on the option "Highly Available" and
change =
"Priority" to "High"<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> 3. Now I'm
trying to check =
Hard-Fencing and power off my first host (KOM-AD01-VM31) from his iLO =
(KOM-AD01-ILO31).<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> Fencing
successfully works and =
server is automatically turned on, but my HA VM not started on second =
host (KOM-AD01-VM32).<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> These events I
see in the oVirt =
web console:<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> Sep 15, 2016
12:08:13 PM =
Host KOM-AD01-VM31 power management was verified =
successfully.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:08:13 PM =
Status of host KOM-AD01-VM31 was set to Up.<br =
class=3D"">
>>>>>>>> Sep 15, 2016
12:08:05 PM =
Executing power management status on Host =
KOM-AD01-VM31 using Proxy Host KOM-AD01-VM32 and Fence Agent ilo:<a =
href=3D"http://kom-ad01-ilo31.holding.com/" rel=3D"noreferrer" =
target=3D"_blank"
class=3D"">KOM-AD01-ILO31.holding.com</a><wbr =
class=3D"">.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:48 PM =
Host KOM-AD01-VM31 is rebooting.<br
class=3D"">
>>>>>>>> Sep 15, 2016
12:05:48 PM =
Host KOM-AD01-VM31 was started by SYSTEM.<br =
class=3D"">
>>>>>>>> Sep 15, 2016
12:05:48 PM =
Power management start of Host KOM-AD01-VM31 =
succeeded.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:41 PM =
Executing power management status on Host =
KOM-AD01-VM31 using Proxy Host KOM-AD01-VM32 and Fence Agent ilo:<a =
href=3D"http://kom-ad01-ilo31.holding.com/" rel=3D"noreferrer" =
target=3D"_blank"
class=3D"">KOM-AD01-ILO31.holding.com</a><wbr =
class=3D"">.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:19 PM =
Executing power management start on Host =
KOM-AD01-VM31 using Proxy Host KOM-AD01-VM32 and Fence Agent ilo:<a =
href=3D"http://kom-ad01-ilo31.holding.com/" rel=3D"noreferrer" =
target=3D"_blank"
class=3D"">KOM-AD01-ILO31.holding.com</a><wbr =
class=3D"">.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:19 PM =
Power management start of Host KOM-AD01-VM31 =
initiated.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:19 PM =
Auto fence for host KOM-AD01-VM31 was started.<br =
class=3D"">
>>>>>>>> Sep 15, 2016
12:05:11 PM =
Executing power management status on Host =
KOM-AD01-VM31 using Proxy Host KOM-AD01-VM32 and Fence Agent ilo:<a =
href=3D"http://kom-ad01-ilo31.holding.com/" rel=3D"noreferrer" =
target=3D"_blank"
class=3D"">KOM-AD01-ILO31.holding.com</a><wbr =
class=3D"">.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:04 PM =
Executing power management status on Host =
KOM-AD01-VM31 using Proxy Host KOM-AD01-VM32 and Fence Agent ilo:<a =
href=3D"http://kom-ad01-ilo31.holding.com/" rel=3D"noreferrer" =
target=3D"_blank"
class=3D"">KOM-AD01-ILO31.holding.com</a><wbr =
class=3D"">.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:05:04 PM =
Host KOM-AD01-VM31 is non responsive.<br
class=3D"">
>>>>>>>> Sep 15, 2016
12:02:32 PM =
Host KOM-AD01-VM31 is not responding. It will stay =
in Connecting state for a grace period of 60 seconds and after that an =
attempt to fence the host will be issued.<br class=3D"">
>>>>>>>> Sep 15, 2016
12:02:32 PM =
VDSM KOM-AD01-VM31 command failed: Heartbeat =
exeeded<br class=3D"">
>>>>>>>> Sep 15, 2016
12:02:04 PM =
VM KOM-AD01-PBX02 is down. Exit message: User shut =
down from within the guest<br class=3D"">
>>>>>>>><br
class=3D"">
>>>>>>>> What am I doing
wrong? Why HA VM =
not start on a second host?<br class=3D"">
>>>>>>>> =
______________________________<wbr class=3D"">_________________<br =
class=3D"">
>>>>>>>> Users mailing
list<br class=3D"">
>>>>>>>> <a
href=3D"mailto:Users@ovirt.org" =
target=3D"_blank" class=3D"">Users(a)ovirt.org</a><br
class=3D"">
>>>>>>>> <a =
href=3D"http://lists.ovirt.org/mailman/listinfo/users"
rel=3D"noreferrer" =
target=3D"_blank"
class=3D"">http://lists.ovirt.org/mailman<wbr
=
class=3D"">/listinfo/users</a><br class=3D"">
> ______________________________<wbr
class=3D"">_________________<br =
class=3D"">
> Users mailing list<br class=3D"">
> <a href=3D"mailto:Users@ovirt.org" target=3D"_blank" =
class=3D"">Users(a)ovirt.org</a><br class=3D"">
> <a
href=3D"http://lists.ovirt.org/mailman/listinfo/users" =
rel=3D"noreferrer" target=3D"_blank" =
class=3D"">http://lists.ovirt.org/mailman<wbr =
class=3D"">/listinfo/users</a><br class=3D"">
><br class=3D"">
><br class=3D"">
<br class=3D"">
</blockquote></div></div></div><br
class=3D""></div></div>
<br class=3D"">______________________________<wbr =
class=3D"">_________________<br class=3D"">
Users mailing list<br class=3D"">
<a href=3D"mailto:Users@ovirt.org" target=3D"_blank" =
class=3D"">Users(a)ovirt.org</a><br class=3D"">
<a
href=3D"http://lists.ovirt.org/mailman/listinfo/users" =
rel=3D"noreferrer" target=3D"_blank" =
class=3D"">http://lists.ovirt.org/mailman<wbr =
class=3D"">/listinfo/users</a><br class=3D"">
<br class=3D""></blockquote></div><br
class=3D""></div></div>
</blockquote></div><br class=3D""></div></div>
</div></blockquote></div><br
class=3D""></body></html>=
--Apple-Mail=_888E7FA5-6914-4728-8646-862772EBB5CF--