This is a multi-part message in MIME format.
--------------85B0C37C08377D8D314D9F67
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Hello folks.
Ou oVirt (4.1.7.3-1.el7.centos) which runs in one Datacenter and
controls Nodes locally and also remotelly lost communication with the
remote Nodes in another Datacenter.
To this point nothing wrong as the Nodes can continue working as
expected and running their Virtual Machines each without dependency of
the oVirt Engine.
What happened at some point is that when the communication between
Engine and Hosts came back Hosts in the remote Datacenter got confused
and initiated a Live Migration of ALL VMs from one of the hosts to
another. I had also to restart vdsmd agent on all Hosts in order to get
sanity my environment.
What adds up even more strangeness to this scenario is that one of the
Hosts affected by the need of restarting VDSM doesn't belong to the same
Cluster as the others and had to have the vdsmd restarted.
I understand the Hosts can survive without the Engine online with
reduced possibilities but can communicated between them, but without
affecting the VMs or even needing to do what happened in this scenario.
Am I wrong on any of the assumptions ?
Fernando
--------------85B0C37C08377D8D314D9F67
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: 7bit
<html>
<head>
<meta http-equiv="content-type" content="text/html;
charset=utf-8">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<font face="arial, helvetica, sans-serif">Hello folks.<br>
<br>
</font><font face="arial, helvetica, sans-serif"><font
face="arial,
helvetica, sans-serif">Ou oVirt (</font></font><font
face="arial, helvetica, sans-serif"><font face="arial,
helvetica,
sans-serif"><span
class="version-text">4.1.7.3-1.el7.centos)</span>
which runs in one Datacenter and controls Nodes locally and also
remotelly lost communication with the remote Nodes in another
Datacenter.<br>
To this point nothing wrong as the Nodes can continue working as
expected and running their Virtual Machines each without
dependency of the oVirt Engine.<br>
<br>
What happened at some point is that when the communication
between Engine and Hosts came back Hosts in the remote
Datacenter got confused and initiated a Live Migration of ALL
VMs from one of the hosts to another. I had also to restart
vdsmd agent on all Hosts in order to get sanity my environment.<br>
<br>
What adds up even more strangeness to this scenario is that one
of the Hosts affected by the need of restarting VDSM doesn't
belong to the same Cluster as the others and had to have the
vdsmd restarted.<br>
<br>
I understand the Hosts can survive without the Engine online
with reduced possibilities but can communicated between them,
but without affecting the VMs or even needing to do what
happened in this scenario.<br>
<br>
Am I wrong on any of the assumptions ?<br>
<br>
Fernando</font></font>
</body>
</html>
--------------85B0C37C08377D8D314D9F67--