
We have an ovirt master (engine) host in Los Angeles and some remote servers in the UK. Normally they work fine, but when there is a heavy load on the UK servers the management engine has problems with heartbeat and ends up trying to restart the nodes. I saw in this thread that I can change vdsHeartbeatInSeconds (https://www.mail-archive.com/users@ovirt.org/msg41695.html) but I don't really want to change it globally, just for the nodes in UK. Also not sure how to get the current setting of that value, only how to change it. How do I tell current value? I heard default is 30 seconds. ovirt-engine-4.1.0.4-1.el7.centos.noarch Or maybe its not best practice to have a cluster that far from the engine? 2017-08-24 11:27:51,921-07 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler3) [feefbf3f-d0e2-4a64-b008-80838d04f130] Failed to refresh VDS, network error, continuing, vds='ovirt1.evuk.j2noc.com'(d0482635-93fd-4cc3-9c78-523078845f11): VDSGenericException: VDSNetworkException: Heartbeat exceeded

On Thu, Aug 24, 2017 at 9:55 PM, Bill James <bill.james@j2.com> wrote:
We have an ovirt master (engine) host in Los Angeles and some remote servers in the UK. Normally they work fine, but when there is a heavy load on the UK servers the management engine has problems with heartbeat and ends up trying to restart the nodes.
Perhaps the mgmt interface is used for traffic other than mgmt? On small scale it's OK. For bigger scale and workloads, it's best to separate traffic to dedicated NICs.
I saw in this thread that I can change vdsHeartbeatInSeconds ( https://www.mail-archive.com/users@ovirt.org/msg41695.html) but I don't really want to change it globally, just for the nodes in UK. Also not sure how to get the current setting of that value, only how to change it. How do I tell current value? I heard default is 30 seconds.
To change it: usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "update vdc_options set option_value = 90 where option_name = 'vdsHeartbeatInSeconds'
ovirt-engine-4.1.0.4-1.el7.centos.noarch
I recommend upgrade, though not specifically due to the above issue.
Or maybe its not best practice to have a cluster that far from the engine?
We have an Engien in Israel managing hosts in Europe and the US. Y.
2017-08-24 11:27:51,921-07 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler3) [feefbf3f-d0e2-4a64-b008-80838d04f130] Failed to refresh VDS, network error, continuing, vds='ovirt1.evuk.j2noc.com'(d0 482635-93fd-4cc3-9c78-523078845f11): VDSGenericException: VDSNetworkException: Heartbeat exceeded
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

--------------70E5210877A04E5B4397BEEC Content-Type: text/plain; charset="utf-8"; format=flowed Content-Transfer-Encoding: 8bit thanks, we found when we switched to a different internet provider that the network issues went away. I'll look into upgrading, but it hard for us to keep up. We just got to 4.1.0 recently. :-) On 8/24/17 10:57 PM, Yaniv Kaul wrote:
On Thu, Aug 24, 2017 at 9:55 PM, Bill James <bill.james@j2.com <mailto:bill.james@j2.com>> wrote:
We have an ovirt master (engine) host in Los Angeles and some remote servers in the UK. Normally they work fine, but when there is a heavy load on the UK servers the management engine has problems with heartbeat and ends up trying to restart the nodes.
Perhaps the mgmt interface is used for traffic other than mgmt? On small scale it's OK. For bigger scale and workloads, it's best to separate traffic to dedicated NICs.
I saw in this thread that I can change vdsHeartbeatInSeconds (https://www.mail-archive.com/users@ovirt.org/msg41695.html <https://www.mail-archive.com/users@ovirt.org/msg41695.html>) but I don't really want to change it globally, just for the nodes in UK. Also not sure how to get the current setting of that value, only how to change it. How do I tell current value? I heard default is 30 seconds.
To change it: usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "update vdc_options set option_value = 90 where option_name = 'vdsHeartbeatInSeconds'
ovirt-engine-4.1.0.4-1.el7.centos.noarch
I recommend upgrade, though not specifically due to the above issue.
Or maybe its not best practice to have a cluster that far from the engine?
We have an Engien in Israel managing hosts in Europe and the US. Y.
2017-08-24 11:27:51,921-07 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler3) [feefbf3f-d0e2-4a64-b008-80838d04f130] Failed to refresh VDS, network error, continuing, vds='ovirt1.evuk.j2noc.com <http://ovirt1.evuk.j2noc.com>'(d0482635-93fd-4cc3-9c78-523078845f11): VDSGenericException: VDSNetworkException: Heartbeat exceeded
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users <http://lists.ovirt.org/mailman/listinfo/users>
Cloud Services for Business www.j2.com j2 | eFax | eVoice | FuseMail | Campaigner | KeepItSafe | Onebox This email, including its contents and attachments, contains information from j2 Global, Inc. and/or its affiliates that may be privileged, confidential, or otherwise protected from disclosure. The information is intended for the addressee(s) only. If you are not an addressee, any disclosure, copy, distribution, or use of this message is prohibited. If you have received this email in error, please immediately notify the sender by reply email and delete the message and any copies. ©2017 j2 Global, Inc. and affiliates. All rights reserved. eFax®, eVoice®, Campaigner®, FuseMail®, KeepItSafe®, and Onebox® are trademarks of j2 Global, Inc. and affiliates. --------------70E5210877A04E5B4397BEEC Content-Type: text/html; charset="utf-8" Content-Transfer-Encoding: 8bit <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> </head> <body text="#000000" bgcolor="#FFFFFF"> thanks, we found when we switched to a different internet provider that the network issues went away.<br> I'll look into upgrading, but it hard for us to keep up. We just got to 4.1.0 recently. :-)<br> <br> <br> <div class="moz-cite-prefix">On 8/24/17 10:57 PM, Yaniv Kaul wrote:<br> </div> <blockquote type="cite" cite="mid:CAJgorsbdG23AmO2ZThC7jaFcOugiEKMN5w77GXkata21d1uCyw@mail.gmail.com"> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> <div dir="ltr"><br> <div class="gmail_extra"><br> <div class="gmail_quote">On Thu, Aug 24, 2017 at 9:55 PM, Bill James <span dir="ltr"><<a href="mailto:bill.james@j2.com" target="_blank" moz-do-not-send="true">bill.james@j2.com</a>></span> wrote:<br> <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">We have an ovirt master (engine) host in Los Angeles and some remote servers in the UK.<br> Normally they work fine, but when there is a heavy load on the UK servers the management engine has problems with heartbeat and ends up trying to restart the nodes.<br> </blockquote> <div><br> </div> <div>Perhaps the mgmt interface is used for traffic other than mgmt? On small scale it's OK. For bigger scale and workloads, it's best to separate traffic to dedicated NICs.</div> <div> </div> <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"> <br> I saw in this thread that I can change vdsHeartbeatInSeconds (<a href="https://www.mail-archive.com/users@ovirt.org/msg41695.html" rel="noreferrer" target="_blank" moz-do-not-send="true">https://www.mail-archive.com/<wbr>users@ovirt.org/msg41695.html</a>)<br> but I don't really want to change it globally, just for the nodes in UK.<br> Also not sure how to get the current setting of that value, only how to change it. How do I tell current value? I heard default is 30 seconds.<br> </blockquote> <div><br> </div> <div>To change it:</div> <div>usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "update vdc_options set option_value = 90 where option_name = 'vdsHeartbeatInSeconds'</div> <div><br> </div> <div> </div> <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"> <br> ovirt-engine-4.1.0.4-1.el7.cen<wbr>tos.noarch<br> </blockquote> <div><br> </div> <div>I recommend upgrade, though not specifically due to the above issue.</div> <div> </div> <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"> <br> Or maybe its not best practice to have a cluster that far from the engine?<br> </blockquote> <div><br> </div> <div>We have an Engien in Israel managing hosts in Europe and the US.</div> <div>Y.</div> <div> </div> <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"> <br> <br> 2017-08-24 11:27:51,921-07 WARN [org.ovirt.engine.core.vdsbrok<wbr>er.VdsManager] (DefaultQuartzScheduler3) [feefbf3f-d0e2-4a64-b008-80838<wbr>d04f130] Failed to refresh VDS, network error, continuing, vds='<a href="http://ovirt1.evuk.j2noc.com" rel="noreferrer" target="_blank" moz-do-not-send="true">ovirt1.evuk.j2noc.com</a>'(d0<wbr>482635-93fd-4cc3-9c78-52307884<wbr>5f11): VDSGenericException: VDSNetworkException: Heartbeat exceeded<br> <br> ______________________________<wbr>_________________<br> Users mailing list<br> <a href="mailto:Users@ovirt.org" target="_blank" moz-do-not-send="true">Users@ovirt.org</a><br> <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank" moz-do-not-send="true">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br> </blockquote> </div> <br> </div> </div> </blockquote> <br> <p><a href="http://www.j2global.com/?utm_source=j2global&utm_medium=xsell-referral&utm_campaign=employeeemail"><span style='color:windowtext; text-decoration:none'><img border=0 width=391 height=46 src="http://home.j2.com/j2_Global_Cloud_Services/j2_Global_Email_Footer.jpg" alt="www.j2global.com"></span></a></p> <p><span style='font-size:8.0pt;font-family:"Arial","sans-serif"; color:gray'>This email, including its contents and attachments contains information from <a href="http://www.j2global.com/?utm_source=j2global&utm_medium=xsell-referral&utm_campaign=employemail">j2 Global, Inc</a>. and/or its affiliates that may be privileged, confidential or otherwise protected from disclosure. The information is intended for the addressee(s) only. If you are not an addressee, any disclosure, copy, distribution, or use of this message is prohibited. If you have received this email in error, please immediately notify the sender by reply email and delete the message and any copies. © 2017 <a href="http://www.j2global.com/">j2 Global, Inc</a>, and affiliates. All rights reserved. <a href="http://www.efax.com/">eFax ®</a>, <a href="http://www.evoice.com/">eVoice ®</a>, <a href="http://www.campaigner.com/">Campaigner ®</a>, <a href="http://www.fusemail.com/">FuseMail ®</a>, <a href="http://www.keepitsafe.com/">KeepItSafe ®</a> and <a href="http://www.onebox.com/">Onebox ®</a> are trademarks of <a href="http://www.j2global.com/">j2 Global, Inc</a>. and its affiliates.</span></p></body> </html> --------------70E5210877A04E5B4397BEEC--
participants (2)
-
Bill James
-
Yaniv Kaul