[Users] Hosted Engine always reports "unknown stale-data"

Andrew Lau andrew at andrewklau.com
Mon Feb 3 12:29:55 UTC 2014


On Mon, Feb 3, 2014 at 11:27 PM, Itamar Heim <iheim at redhat.com> wrote:

> On 02/03/2014 01:25 PM, Andrew Lau wrote:
>
>> On Mon, Feb 3, 2014 at 11:23 PM, Itamar Heim <iheim at redhat.com
>> <mailto:iheim at redhat.com>>wrote:
>>
>>
>>     On 02/03/2014 01:19 PM, Andrew Lau wrote:
>>
>>         The issue was a split-brain issue on the dom_md/ids file causing
>> an
>>         input/output error, thanks!
>>
>>
>>     is this with gluster?
>>
>>
>> ​Yup a 2 brick gluster replicated instance serving the NFS server, sorry
>> was meant to say I resolved it too.​
>>
>
> you have to use a gluster with quorum, or this will happen often
>>

​Yeah, I disabled quorum temporarily because I'm only using a two host
scenario and I need to have the case scenario where one is to be shutdown
the VMs won't end up in a paused state.


>>
>
>
>>
>>
>>         On Mon, Feb 3, 2014 at 10:43 PM, Andrew Lau
>>         <andrew at andrewklau.com <mailto:andrew at andrewklau.com>
>>         <mailto:andrew at andrewklau.com <mailto:andrew at andrewklau.com>>__>
>>         wrote:
>>
>>              On Mon, Feb 3, 2014 at 10:40 PM, Doron Fediuck
>>         <dfediuck at redhat.com <mailto:dfediuck at redhat.com>
>>              <mailto:dfediuck at redhat.com <mailto:dfediuck at redhat.com>>>
>>         wrote:
>>
>>
>>
>>                  ----- Original Message -----
>>                   > From: "Andrew Lau" <andrew at andrewklau.com
>>         <mailto:andrew at andrewklau.com>
>>                  <mailto:andrew at andrewklau.com
>>         <mailto:andrew at andrewklau.com>>__>
>>                   > To: "Doron Fediuck" <dfediuck at redhat.com
>>         <mailto:dfediuck at redhat.com>
>>                  <mailto:dfediuck at redhat.com <mailto:dfediuck at redhat.com
>> >>>
>>                   > Cc: "users" <users at ovirt.org
>>         <mailto:users at ovirt.org> <mailto:users at ovirt.org
>>         <mailto:users at ovirt.org>>>, "Jiri
>>                  Moskovcak" <jmoskovc at redhat.com
>>         <mailto:jmoskovc at redhat.com> <mailto:jmoskovc at redhat.com
>>         <mailto:jmoskovc at redhat.com>>>,
>>                  "Greg Padgett" <gpadgett at redhat.com
>>         <mailto:gpadgett at redhat.com> <mailto:gpadgett at redhat.com
>>         <mailto:gpadgett at redhat.com>>>
>>                   > Sent: Monday, February 3, 2014 1:35:01 PM
>>                   > Subject: Re: [Users] Hosted Engine always reports
>>         "unknown
>>                  stale-data"
>>                   >
>>                   > On Mon, Feb 3, 2014 at 9:53 PM, Doron Fediuck
>>                  <dfediuck at redhat.com <mailto:dfediuck at redhat.com>
>>         <mailto:dfediuck at redhat.com <mailto:dfediuck at redhat.com>>> wrote:
>>                   >
>>                   > >
>>                   > >
>>                   > > ----- Original Message -----
>>                   > > > From: "Andrew Lau" <andrew at andrewklau.com
>>         <mailto:andrew at andrewklau.com>
>>                  <mailto:andrew at andrewklau.com
>>         <mailto:andrew at andrewklau.com>>__>
>>                   > > > To: "users" <users at ovirt.org
>>         <mailto:users at ovirt.org> <mailto:users at ovirt.org
>>         <mailto:users at ovirt.org>>>
>>                   > > > Sent: Monday, February 3, 2014 12:32:45 PM
>>                   > > > Subject: [Users] Hosted Engine always reports
>>         "unknown
>>                  stale-data"
>>                   > > >
>>                   > > > Hi,
>>                   > > >
>>                   > > > I was wondering if anyone has this same notice
>>         when they run:
>>                   > > > hosted-engine --vm-status
>>                   > > >
>>                   > > > The "engine status" will always be "unknown
>>         stale-data"
>>                  even when the VM
>>                   > > is
>>                   > > > powered on and the engine is online.
>>         engine-health will
>>                  actually report
>>                   > > the
>>                   > > > correct status.
>>                   > > >
>>                   > > > eg.
>>                   > > >
>>                   > > > --== Host 1 status ==--
>>                   > > >
>>                   > > > Status up-to-date : False
>>                   > > > Hostname : 172.16.0.11
>>                   > > > Host ID : 1
>>                   > > > Engine status : unknown stale-data
>>                   > > >
>>                   > > > Is it some sort of blocked port causing this or
>>         is this
>>                  by design?
>>                   > > >
>>                   > > > Thanks,
>>                   > > > Andrew
>>                   > > >
>>                   > > > _________________________________________________
>>                   > > > Users mailing list
>>                   > > > Users at ovirt.org <mailto:Users at ovirt.org>
>>         <mailto:Users at ovirt.org <mailto:Users at ovirt.org>>
>>
>>                   > > > http://lists.ovirt.org/__mailman/listinfo/users
>>         <http://lists.ovirt.org/mailman/listinfo/users>
>>                   > > >
>>                   > >
>>                   > > Hi Andrew,
>>                   > > it looks like an issue with the time stamp.
>>                   > > Which time stamp do you have? How relevant is it?
>>                   > >
>>                   >
>>                   > timestamps seem to be outdated by a lot, interesting
>>         error in
>>                  the broker.log
>>                   >
>>                   > Thread-24::INFO::2014-02-03
>>                   >
>>
>>         22:33:14,801::engine_health::__90::engine_health.__
>> CpuLoadNoEngine::(action)
>>                  VM
>>                   > not running on this host, status down
>>                   > Thread-22::INFO::2014-02-03
>>                   > 22:33:14,834::mem_free::53::__
>> mem_free.MemFree::(action)
>>                  memFree: 27382
>>                   > Thread-23::ERROR::2014-02-03
>>                   >
>>
>>         22:33:14,922::cpu_load_no___engine::156::cpu_load_no___
>> engine.EngineHealth::(update___stat_file)
>>                   > Failed to getVmStats: 'pid'
>>                   > Thread-23::INFO::2014-02-03
>>                   >
>>
>>         22:33:14,923::cpu_load_no___engine::121::cpu_load_no___
>> engine.EngineHealth::(__calculate_load)
>>                   > System load total=0.0124, engine=0.0000,
>>         non-engine=0.0124
>>                   >
>>                   > I'm assuming that update_stat_file is the metadata
>>         file the
>>                  vm-status is
>>                   > getting pulled from?
>>                   >
>>
>>                  Yep.
>>                  Can you please verify the time your host actually has?
>>                  ie- we have a known issue with time, since we assume all
>>                  hosts are in sync. So if one of your hosts has a time
>> sync
>>                  issue, this can explain the problem you see.
>>
>>
>>              --== Host 1 status ==--
>>
>>              Status up-to-date                  : False
>>              Hostname                           : 172.16.0.11
>>              Host ID                            : 1
>>              Engine status                      : unknown stale-data
>>              Score                              : 0
>>              Local maintenance                  : False
>>              Host timestamp                     : 1391417611
>>
>>              --== Host 2 status ==--
>>
>>              Status up-to-date                  : False
>>              Hostname                           : 172.16.0.12
>>              Host ID                            : 2
>>              Engine status                      : unknown stale-data
>>              Score                              : 0
>>              Local maintenance                  : False
>>              Host timestamp                     : 1391417171
>>
>>
>>>>              [root at hv01 ~]# date +%s
>>                                       │[root at hv02 ~]# date +%s
>>              ​​
>>              1391427754
>>                                      │139142775
>>              ​5​
>>
>>              ​​
>>
>>
>>
>>
>>         _________________________________________________
>>         Users mailing list
>>         Users at ovirt.org <mailto:Users at ovirt.org>
>>         http://lists.ovirt.org/__mailman/listinfo/users
>>         <http://lists.ovirt.org/mailman/listinfo/users>
>>
>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20140203/62f7333b/attachment-0001.html>


More information about the Users mailing list