[ovirt-users] New Feature: engine NIC health check

Martin Mucha mmucha at redhat.com
Mon Oct 13 07:49:14 UTC 2014


ad a)
ab b) good, I will make it configurable via engine-config.

ad c) ok, we probably should think about what should be in that table. I've proposed, that only failures should be recorded. Since there's currently no use for other values/NIC states, and NICs will be probably functional most of the time. This will minimize the need to write anything to db, and therefore name "EngineNicFailures". I also wanted to store information in format: "This NIC, was not up during check at 'yyyy.MM.dd HH:mm:ss '". Which gives more information than what you're talking about. But there's no need for this data, so yes currently it'd be possible to store NIC status along with last failure timestamp. So the table could look like:

CREATE TABLE nics_health ( id UUID primary key, name VARCHAR(255), is_healthy BOOL, last_failure TIMESTAMP);

this way, we have to update record during each health check, while in previously proposed [by me] approach only when there's failure, and 'purge' of obsolete data can occur only when there's fence request and/or on some time period. What you're proposing provides less information (which is fine, since we don't need them), it's simpler, but it generates some unnecessary load for db. It's not big deal, but it's not optimal/necessary, since state, when nic is not up is not probable(if I understand it correctly).

Sorry, have no idea what DWH stands for.

ad d) I have no problem extending this feature with providing data over rest, but others should probably agree with that. I don't think any client(external system) need this information, and I also don't think engine nic health is any clients business in first place.

——
I will update feature page asap.
M.


----- Original Message -----
From: "Yair Zaslavsky" <yzaslavs at redhat.com>
To: "Martin Mucha" <mmucha at redhat.com>
Cc: engine-devel at ovirt.org, users at ovirt.org
Sent: Friday, October 10, 2014 3:14:37 PM
Subject: Re: [ovirt-users] New Feature: engine NIC health check



----- Original Message -----
> From: "Martin Mucha" <mmucha at redhat.com>
> To: engine-devel at ovirt.org, users at ovirt.org
> Sent: Wednesday, October 8, 2014 2:33:06 PM
> Subject: [ovirt-users] New Feature: engine NIC health check
> 
> Hi,
> 
> here's link for new feature, related to monitoring engine's NIC, trying to
> detect failure on engine itself and it that case block fencing.
> http://www.ovirt.org/Features/engine_NIC_health_check
> 
> thanks for every input, namely for one addressing some of opened issues.
> 
> M.

I was curious  on how you perform the health check, so I read the feature page - good to learn more Java :)
Regarding open issues -
a. Yes, IMHO the scanning interval should be configured via engine-config - do you see a reason why not to do that? Maybe we should set a minimal interval value and enforSce it?
b. Same for the "no faiures since.." interval
c. I dont like the name of the table you're suggesting. Please consider an alternative. Also you may want to consider having a view that returns you the "static infomration" of the nic + the "stats" part (dynamic part? maybe just nic_state ? ) Why would u like to purge old data and not just hold a record per nic and update per each interval? in this case, no purging is required.
Maybe for DWH you will want some info on the history of the status of the nics... but I'm not sure if this is relevant for now.
d. If you go with my view suggestion, you  might consider displaying the "state" at REST-API

Yair

> _______________________________________________
> Users mailing list
> Users at ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>



More information about the Users mailing list