
Hi all, I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down. It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not. Can this behavior be disabled? Thanx, Alex

Hi Alex, Please provide Engine logs from when this is occurring and mention the date/time we should focus at. Thanks, Edy. On Mon, Feb 5, 2018 at 2:19 PM, Alex K <rightkicktech@gmail.com> wrote:
Hi all,
I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down.
It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not.
Can this behavior be disabled?
Thanx, Alex
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

Hi Edward, So this is not an expected behavior? I will collect logs as soon as I reproduce it. Thanx, Alex On Tue, Feb 6, 2018 at 9:36 AM, Edward Haas <ehaas@redhat.com> wrote:
Hi Alex,
Please provide Engine logs from when this is occurring and mention the date/time we should focus at.
Thanks, Edy.
On Mon, Feb 5, 2018 at 2:19 PM, Alex K <rightkicktech@gmail.com> wrote:
Hi all,
I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down.
It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not.
Can this behavior be disabled?
Thanx, Alex
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

On Feb 5, 2018 2:21 PM, "Alex K" <rightkicktech@gmail.com> wrote: Hi all, I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down. Is the cluster down, or just the self-hosted engine? It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not. Are the VMs down? The hosts? Y. Can this behavior be disabled? Thanx, Alex _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

From: Yaniv Kaul <ykaul@redhat.com> Subject: Re: [ovirt-users] ovirt and gateway behavior Date: February 6, 2018 at 2:40:14 AM CST To: Alex Cc: Ovirt Users =20 =20 =20 On Feb 5, 2018 2:21 PM, "Alex K" <rightkicktech@gmail.com = <mailto:rightkicktech@gmail.com>> wrote: Hi all,=20 =20 I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. = The cluster is used to host several VMs.=20 I have observed that when gateway is lost (say the gateway device is = down) the ovirt cluster goes down.=20 =20 Is the cluster down, or just the self-hosted engine?=20 =20 =20 It seems a bit extreme behavior especially when one does not care if =
--Apple-Mail=_9545B43A-6071-46DF-A6C0-EC3F1563FEE9 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 I=E2=80=99ve seen this sort of happen on my systems, the gateway ip goes = down for some reason, and the engine restarts repeatedly, rending it = unusable, even though it=E2=80=99s on the same ip subnet as all the host = boxes and can still talk to the VDSMs. In my case, it doesn=E2=80=99t = hurt the cluster or DC, but it=E2=80=99s annoying and unnecessary in my = environment where the gateway isn=E2=80=99t important for cluster = communications.. I can understand why using the ip of the gateway became a test as a = proxy for network connectivity, but it seems like it=E2=80=99s something = that isn=E2=80=99t always valid and maybe the local admin should have a = choice of how it=E2=80=99s used. Something like the current fencing = option for =E2=80=9C50% hosts down=E2=80=9D as a double check, if you = can still reach the vdsm hosts, don=E2=80=99t restart the engine vm. -Darrell the hosted VMs have connectivity to Internet or not.=20
=20 Are the VMs down?=20 The hosts?=20 Y.=20 =20 =20 Can this behavior be disabled? =20 Thanx,=20 Alex =20 _______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users = <http://lists.ovirt.org/mailman/listinfo/users> =20 =20 _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
--Apple-Mail=_9545B43A-6071-46DF-A6C0-EC3F1563FEE9 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8 <html><head><meta http-equiv=3D"Content-Type" content=3D"text/html; = charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; = -webkit-nbsp-mode: space; line-break: after-white-space;" = class=3D"">I=E2=80=99ve seen this sort of happen on my systems, the = gateway ip goes down for some reason, and the engine restarts = repeatedly, rending it unusable, even though it=E2=80=99s on the same ip = subnet as all the host boxes and can still talk to the VDSMs. In my = case, it doesn=E2=80=99t hurt the cluster or DC, but it=E2=80=99s = annoying and unnecessary in my environment where the gateway isn=E2=80=99t= important for cluster communications..<div class=3D""><br = class=3D""></div><div class=3D"">I can understand why using the ip of = the gateway became a test as a proxy for network connectivity, but it = seems like it=E2=80=99s something that isn=E2=80=99t always valid and = maybe the local admin should have a choice of how it=E2=80=99s used. = Something like the current fencing option for =E2=80=9C50% hosts down=E2=80= =9D as a double check, if you can still reach the vdsm hosts, don=E2=80=99= t restart the engine vm.</div><div class=3D""><br class=3D""></div><div = class=3D""> -Darrell<br class=3D""><div><blockquote type=3D"cite" = class=3D""><hr style=3D"border:none;border-top:solid #B5C4DF = 1.0pt;padding:0 0 0 0;margin:10px 0 5px 0;" class=3D""><span = style=3D"margin: -1.3px 0.0px 0.0px 0.0px" id=3D"RwhHeaderAttributes" = class=3D""><font face=3D"Helvetica" size=3D"4" color=3D"#000000" = style=3D"font: 13.0px Helvetica; color: #000000" class=3D""><b = class=3D"">From:</b> Yaniv Kaul <<a href=3D"mailto:ykaul@redhat.com" = class=3D"">ykaul@redhat.com</a>></font></span><br class=3D""> <span style=3D"margin: -1.3px 0.0px 0.0px 0.0px" class=3D""><font = face=3D"Helvetica" size=3D"4" color=3D"#000000" style=3D"font: 13.0px = Helvetica; color: #000000" class=3D""><b class=3D"">Subject:</b> Re: = [ovirt-users] ovirt and gateway behavior</font></span><br class=3D""> <span style=3D"margin: -1.3px 0.0px 0.0px 0.0px" class=3D""><font = face=3D"Helvetica" size=3D"4" color=3D"#000000" style=3D"font: 13.0px = Helvetica; color: #000000" class=3D""><b class=3D"">Date:</b> February = 6, 2018 at 2:40:14 AM CST</font></span><br class=3D""> <span style=3D"margin: -1.3px 0.0px 0.0px 0.0px" class=3D""><font = face=3D"Helvetica" size=3D"4" color=3D"#000000" style=3D"font: 13.0px = Helvetica; color: #000000" class=3D""><b class=3D"">To:</b> = Alex</font></span><br class=3D""> <span style=3D"margin: -1.3px 0.0px 0.0px 0.0px" class=3D""><font = face=3D"Helvetica" size=3D"4" color=3D"#000000" style=3D"font: 13.0px = Helvetica; color: #000000" class=3D""><b class=3D"">Cc:</b> Ovirt = Users</font></span><br class=3D""> <br class=3D"Apple-interchange-newline"><div class=3D""><div dir=3D"auto" = class=3D""><div class=3D""><br class=3D""><div class=3D"gmail_extra"><br = class=3D""><div class=3D"gmail_quote">On Feb 5, 2018 2:21 PM, "Alex K" = <<a href=3D"mailto:rightkicktech@gmail.com" = class=3D"">rightkicktech@gmail.com</a>> wrote:<br type=3D"attribution" = class=3D""><blockquote class=3D"quote" style=3D"margin:0 0 0 = .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" = class=3D""><div class=3D""><div class=3D""><div class=3D"">Hi all, <br = class=3D""><br class=3D""></div><div class=3D"">I have a 3 nodes ovirt = 4.1 cluster, self hosted on top of glusterfs. The cluster is used to = host several VMs. <br class=3D""></div>I have observed that when gateway = is lost (say the gateway device is down) the ovirt cluster goes down. = <br class=3D""></div></div></div></blockquote></div></div></div><div = dir=3D"auto" class=3D""><br class=3D""></div><div dir=3D"auto" = class=3D"">Is the cluster down, or just the self-hosted = engine? </div><div dir=3D"auto" class=3D""><br class=3D""></div><div = dir=3D"auto" class=3D""><div class=3D"gmail_extra"><div = class=3D"gmail_quote"><blockquote class=3D"quote" style=3D"margin:0 0 0 = .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" = class=3D""><div class=3D""><div class=3D""><br class=3D""></div>It seems = a bit extreme behavior especially when one does not care if the hosted = VMs have connectivity to Internet or not. <br = class=3D""></div></div></blockquote></div></div></div><div dir=3D"auto" = class=3D""><br class=3D""></div><div dir=3D"auto" class=3D"">Are the VMs = down? </div><div dir=3D"auto" class=3D"">The hosts? </div><div = dir=3D"auto" class=3D"">Y. </div><div dir=3D"auto" class=3D""><br = class=3D""></div><div dir=3D"auto" class=3D""><div = class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote = class=3D"quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc = solid;padding-left:1ex"><div dir=3D"ltr" class=3D""><div class=3D""><br = class=3D""></div><div class=3D"">Can this behavior be disabled?<br = class=3D""><br class=3D""></div><div class=3D"">Thanx, <br = class=3D""></div><div class=3D"">Alex<br class=3D""></div></div> <br class=3D"">______________________________<wbr = class=3D"">_________________<br class=3D""> Users mailing list<br class=3D""> <a href=3D"mailto:Users@ovirt.org" class=3D"">Users@ovirt.org</a><br = class=3D""> <a href=3D"http://lists.ovirt.org/mailman/listinfo/users" = rel=3D"noreferrer" target=3D"_blank" = class=3D"">http://lists.ovirt.org/<wbr = class=3D"">mailman/listinfo/users</a><br class=3D""> <br class=3D""></blockquote></div><br class=3D""></div></div></div> _______________________________________________<br class=3D"">Users = mailing list<br class=3D""><a href=3D"mailto:Users@ovirt.org" = class=3D"">Users@ovirt.org</a><br = class=3D"">http://lists.ovirt.org/mailman/listinfo/users<br = class=3D""></div></blockquote></div><br class=3D""></div></body></html>= --Apple-Mail=_9545B43A-6071-46DF-A6C0-EC3F1563FEE9--

Hi, I have seen hosts rendered unresponsive when gateway is lost. I will be able to provide more info once I prepare an environment and test this further. Thanx, Alex On Tue, Feb 6, 2018 at 10:40 AM, Yaniv Kaul <ykaul@redhat.com> wrote:
On Feb 5, 2018 2:21 PM, "Alex K" <rightkicktech@gmail.com> wrote:
Hi all,
I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down.
Is the cluster down, or just the self-hosted engine?
It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not.
Are the VMs down? The hosts? Y.
Can this behavior be disabled?
Thanx, Alex
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

This is expected behaviour, even if it’s not very bright. It’s being used as a way to detect network is operating correctly. I got this trying to install on a network with out a gateway. It is insane as there are so many ways it breaks. My network admin turns off ICMP responses and death to network. On Tue 6. Feb 2018 at 16:27, Alex K <rightkicktech@gmail.com> wrote:
Hi,
I have seen hosts rendered unresponsive when gateway is lost. I will be able to provide more info once I prepare an environment and test this further.
Thanx, Alex
On Tue, Feb 6, 2018 at 10:40 AM, Yaniv Kaul <ykaul@redhat.com> wrote:
On Feb 5, 2018 2:21 PM, "Alex K" <rightkicktech@gmail.com> wrote:
Hi all,
I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down.
Is the cluster down, or just the self-hosted engine?
It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not.
Are the VMs down? The hosts? Y.
Can this behavior be disabled?
Thanx, Alex
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

This is expected behaviour, even if it’s not very bright. It’s being used as a way to detect network is operating correctly.
Correct, it is used to check whether users can reach the host and the VM that runs on it. There aren't that many options to check that. All require data exchange of some kind (ICMP req/res, TCP SYN/ACK, some UDP echo..).
It is insane as there are so many ways it breaks. My network admin turns off ICMP responses and death to network.
ICMP is an important signaling mechanism.. seriously, it is usually a bad idea to block it.
I got this trying to install on a network with out a gateway.
How were your users accessing the VMs? Was this some kind of super secure deployment with no outside connectivity? Best regards Martin Sivak On Tue, Feb 6, 2018 at 4:32 PM, Ben De Luca <bdeluca@gmail.com> wrote:
This is expected behaviour, even if it’s not very bright. It’s being used as a way to detect network is operating correctly.
I got this trying to install on a network with out a gateway.
It is insane as there are so many ways it breaks. My network admin turns off ICMP responses and death to network.
On Tue 6. Feb 2018 at 16:27, Alex K <rightkicktech@gmail.com> wrote:
Hi,
I have seen hosts rendered unresponsive when gateway is lost. I will be able to provide more info once I prepare an environment and test this further.
Thanx, Alex
On Tue, Feb 6, 2018 at 10:40 AM, Yaniv Kaul <ykaul@redhat.com> wrote:
On Feb 5, 2018 2:21 PM, "Alex K" <rightkicktech@gmail.com> wrote:
Hi all,
I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down.
Is the cluster down, or just the self-hosted engine?
It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not.
Are the VMs down? The hosts? Y.
Can this behavior be disabled?
Thanx, Alex
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

Hi, ee use the ping check to see whether the host running hosted engine has connectivity with the rest of the cluster and users. We kill the VM in a hope that some other host will make the engine available to users again. We use the gateway by default as it is pretty common to have separate network for data center, but you can change the address if your topology is different. Best regards Martin Sivak On Tue, Feb 6, 2018 at 4:27 PM, Alex K <rightkicktech@gmail.com> wrote:
Hi,
I have seen hosts rendered unresponsive when gateway is lost. I will be able to provide more info once I prepare an environment and test this further.
Thanx, Alex
On Tue, Feb 6, 2018 at 10:40 AM, Yaniv Kaul <ykaul@redhat.com> wrote:
On Feb 5, 2018 2:21 PM, "Alex K" <rightkicktech@gmail.com> wrote:
Hi all,
I have a 3 nodes ovirt 4.1 cluster, self hosted on top of glusterfs. The cluster is used to host several VMs. I have observed that when gateway is lost (say the gateway device is down) the ovirt cluster goes down.
Is the cluster down, or just the self-hosted engine?
It seems a bit extreme behavior especially when one does not care if the hosted VMs have connectivity to Internet or not.
Are the VMs down? The hosts? Y.
Can this behavior be disabled?
Thanx, Alex
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
participants (6)
-
Alex K
-
Ben De Luca
-
Darrell Budic
-
Edward Haas
-
Martin Sivak
-
Yaniv Kaul