On Thu, Aug 24, 2017 at 9:55 PM, Bill James <bill.james(a)j2.com> wrote:
We have an ovirt master (engine) host in Los Angeles and some remote
servers in the UK.
Normally they work fine, but when there is a heavy load on the UK servers
the management engine has problems with heartbeat and ends up trying to
restart the nodes.
Perhaps the mgmt interface is used for traffic other than mgmt? On small
scale it's OK. For bigger scale and workloads, it's best to separate
traffic to dedicated NICs.
I saw in this thread that I can change vdsHeartbeatInSeconds (
https://www.mail-archive.com/users@ovirt.org/msg41695.html)
but I don't really want to change it globally, just for the nodes in UK.
Also not sure how to get the current setting of that value, only how to
change it. How do I tell current value? I heard default is 30 seconds.
To change it:
usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "update vdc_options set
option_value = 90 where option_name = 'vdsHeartbeatInSeconds'
ovirt-engine-4.1.0.4-1.el7.centos.noarch
I recommend upgrade, though not specifically due to the above issue.
Or maybe its not best practice to have a cluster that far from the engine?
We have an Engien in Israel managing hosts in Europe and the US.
Y.
2017-08-24 11:27:51,921-07 WARN [org.ovirt.engine.core.vdsbroker.VdsManager]
(DefaultQuartzScheduler3) [feefbf3f-d0e2-4a64-b008-80838d04f130] Failed
to refresh VDS, network error, continuing, vds='ovirt1.evuk.j2noc.com'(d0
482635-93fd-4cc3-9c78-523078845f11): VDSGenericException:
VDSNetworkException: Heartbeat exceeded
_______________________________________________
Users mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users