[Users] Hosted Engine always reports "unknown stale-data"
Andrew Lau
andrew at andrewklau.com
Mon Feb 3 07:25:35 EST 2014
On Mon, Feb 3, 2014 at 11:23 PM, Itamar Heim <iheim at redhat.com> wrote:
> On 02/03/2014 01:19 PM, Andrew Lau wrote:
>
>> The issue was a split-brain issue on the dom_md/ids file causing an
>> input/output error, thanks!
>>
>
> is this with gluster?
>
Yup a 2 brick gluster replicated instance serving the NFS server, sorry
was meant to say I resolved it too.
>
>
>> On Mon, Feb 3, 2014 at 10:43 PM, Andrew Lau <andrew at andrewklau.com
>> <mailto:andrew at andrewklau.com>> wrote:
>>
>> On Mon, Feb 3, 2014 at 10:40 PM, Doron Fediuck <dfediuck at redhat.com
>> <mailto:dfediuck at redhat.com>> wrote:
>>
>>
>>
>> ----- Original Message -----
>> > From: "Andrew Lau" <andrew at andrewklau.com
>> <mailto:andrew at andrewklau.com>>
>> > To: "Doron Fediuck" <dfediuck at redhat.com
>> <mailto:dfediuck at redhat.com>>
>> > Cc: "users" <users at ovirt.org <mailto:users at ovirt.org>>, "Jiri
>> Moskovcak" <jmoskovc at redhat.com <mailto:jmoskovc at redhat.com>>,
>> "Greg Padgett" <gpadgett at redhat.com <mailto:gpadgett at redhat.com>>
>> > Sent: Monday, February 3, 2014 1:35:01 PM
>> > Subject: Re: [Users] Hosted Engine always reports "unknown
>> stale-data"
>> >
>> > On Mon, Feb 3, 2014 at 9:53 PM, Doron Fediuck
>> <dfediuck at redhat.com <mailto:dfediuck at redhat.com>> wrote:
>> >
>> > >
>> > >
>> > > ----- Original Message -----
>> > > > From: "Andrew Lau" <andrew at andrewklau.com
>> <mailto:andrew at andrewklau.com>>
>> > > > To: "users" <users at ovirt.org <mailto:users at ovirt.org>>
>> > > > Sent: Monday, February 3, 2014 12:32:45 PM
>> > > > Subject: [Users] Hosted Engine always reports "unknown
>> stale-data"
>> > > >
>> > > > Hi,
>> > > >
>> > > > I was wondering if anyone has this same notice when they
>> run:
>> > > > hosted-engine --vm-status
>> > > >
>> > > > The "engine status" will always be "unknown stale-data"
>> even when the VM
>> > > is
>> > > > powered on and the engine is online. engine-health will
>> actually report
>> > > the
>> > > > correct status.
>> > > >
>> > > > eg.
>> > > >
>> > > > --== Host 1 status ==--
>> > > >
>> > > > Status up-to-date : False
>> > > > Hostname : 172.16.0.11
>> > > > Host ID : 1
>> > > > Engine status : unknown stale-data
>> > > >
>> > > > Is it some sort of blocked port causing this or is this
>> by design?
>> > > >
>> > > > Thanks,
>> > > > Andrew
>> > > >
>> > > > _______________________________________________
>> > > > Users mailing list
>> > > > Users at ovirt.org <mailto:Users at ovirt.org>
>>
>> > > > http://lists.ovirt.org/mailman/listinfo/users
>> > > >
>> > >
>> > > Hi Andrew,
>> > > it looks like an issue with the time stamp.
>> > > Which time stamp do you have? How relevant is it?
>> > >
>> >
>> > timestamps seem to be outdated by a lot, interesting error in
>> the broker.log
>> >
>> > Thread-24::INFO::2014-02-03
>> >
>> 22:33:14,801::engine_health::90::engine_health.
>> CpuLoadNoEngine::(action)
>> VM
>> > not running on this host, status down
>> > Thread-22::INFO::2014-02-03
>> > 22:33:14,834::mem_free::53::mem_free.MemFree::(action)
>> memFree: 27382
>> > Thread-23::ERROR::2014-02-03
>> >
>> 22:33:14,922::cpu_load_no_engine::156::cpu_load_no_
>> engine.EngineHealth::(update_stat_file)
>> > Failed to getVmStats: 'pid'
>> > Thread-23::INFO::2014-02-03
>> >
>> 22:33:14,923::cpu_load_no_engine::121::cpu_load_no_
>> engine.EngineHealth::(calculate_load)
>> > System load total=0.0124, engine=0.0000, non-engine=0.0124
>> >
>> > I'm assuming that update_stat_file is the metadata file the
>> vm-status is
>> > getting pulled from?
>> >
>>
>> Yep.
>> Can you please verify the time your host actually has?
>> ie- we have a known issue with time, since we assume all
>> hosts are in sync. So if one of your hosts has a time sync
>> issue, this can explain the problem you see.
>>
>>
>> --== Host 1 status ==--
>>
>> Status up-to-date : False
>> Hostname : 172.16.0.11
>> Host ID : 1
>> Engine status : unknown stale-data
>> Score : 0
>> Local maintenance : False
>> Host timestamp : 1391417611
>>
>> --== Host 2 status ==--
>>
>> Status up-to-date : False
>> Hostname : 172.16.0.12
>> Host ID : 2
>> Engine status : unknown stale-data
>> Score : 0
>> Local maintenance : False
>> Host timestamp : 1391417171
>>
>>
>>
>> [root at hv01 ~]# date +%s
>> │[root at hv02 ~]# date +%s
>>
>> 1391427754
>> │139142775
>> 5
>>
>>
>>
>>
>>
>>
>> _______________________________________________
>> Users mailing list
>> Users at ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20140203/6898d4f2/attachment-0001.html>
More information about the Users
mailing list