
--Apple-Mail=_F6BEE682-0833-4DDF-AC85-B3A2BC1EDFF6 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8
On 3 Jul 2017, at 15:35, Shlomo Ben David <sbendavi@redhat.com> wrote: =20 Hi, =20 Test failed: [ 006_migrations.migrate_vm ] Link to suspected patches: N/A Link to Job: = http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_master/7431/ = <http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_master/7431/> Link to all logs:=20 Error snippet from the log: = http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_master/7431/arti= fact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suite-master= /post-006_migrations.py/ = <http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_master/7431/art= ifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suite-maste= r/post-006_migrations.py/> =20 <error> =20 "Fault reason is "Operation Failed". Fault detail is "[Cannot migrate = VM. There is no host that satisfies current scheduling constraints. See = below for details:, The host lago-basic-suite-master-host0 did not = satisfy internal filter CPUOverloaded because its CPU is too loaded.]" =20 </error> =20 <engine log> =20 2017-07-02 16:43:22,829-04 INFO = [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default task-27) = [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Lock Acquired to object = 'EngineLock:{exclusiveLocks=3D'[2b34910d-cef2-44d6-a274-30e8473eb5d9=3DVM]= ', sharedLocks=3D''}' 2017-07-02 16:43:22,833-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = Compiled stored procedure. Call string is [{call = getdiskvmelementspluggedtovm(?)}] 2017-07-02 16:43:22,833-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = SqlCall for procedure [GetDiskVmElementsPluggedToVm] compiled 2017-07-02 16:43:22,843-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = Compiled stored procedure. Call string is [{call = getattacheddisksnapshotstovm(?, ?)}] 2017-07-02 16:43:22,843-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = SqlCall for procedure [GetAttachedDiskSnapshotsToVm] compiled 2017-07-02 16:43:22,919-04 INFO = [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default = task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Candidate host = 'lago-basic-suite-master-host0' ('46bdc63d-98f5-4eee-81aa-2fb88b8f7cbe') = was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPUOverloaded' = (correlation id: null) 2017-07-02 16:43:22,920-04 WARN = [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default task-27) = [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Validation of action = 'MigrateVmToServer' failed for user admin@internal-authz. Reasons: = VAR__ACTION__MIGRATE,VAR__TYPE__VM,SCHEDULING_ALL_HOSTS_FILTERED_OUT,VAR__= FILTERTYPE__INTERNAL,$hostName lago-basic-suite-master-host0,$filterName = CPUOverloaded,VAR__DETAIL__CPU_OVERLOADED,SCHEDULING_HOST_FILTERED_REASON_= WITH_DETAIL
This has nothing to do with migration The CPUOverload is a scheduling policy, unless there was any change in = that area the obvious explanation would be that the host has a CPU = overload condition. I briefly looked at logs and see ""cpuUser": "83.40", "cpuSys": "16.59", = "cpuIdle": =E2=80=9C0.08=E2=80=9D=E2=80=9D which indeed suggests an = overload, from the same sample I can see it=E2=80=99s vdsm = ("cpuUserVdsmd": =E2=80=9C77.38=E2=80=9D, cpuSysVdsmd": =E2=80=9C18.44" Since similar values are consistently being reported for some time, and = there is a setupNetworks and storage rescan prior to the the failure, = and there is no other indication of anything wrong, I=E2=80=99d just say = the environment or the order of tests or timing has changed, but nothing = wrong with the oVirt code Did any of that changed recently? Does it reproduce locally? Thanks, michal
2017-07-02 16:43:22,920-04 INFO = [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default task-27) = [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Lock freed to object = 'EngineLock:{exclusiveLocks=3D'[2b34910d-cef2-44d6-a274-30e8473eb5d9=3DVM]= ', sharedLocks=3D''}' 2017-07-02 16:43:22,929-04 DEBUG = [org.ovirt.engine.core.utils.timer.FixedDelayJobListener] = (DefaultQuartzScheduler7) [] Rescheduling = DEFAULT.org.ovirt.engine.core.bll.ColdRebootAutoStartVmsRunner.startFailed= AutoStartVms#-9223372036854775733 as there is no unfired trigger. 2017-07-02 16:43:22,932-04 ERROR = [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default = task-27) [] Operation Failed: [Cannot migrate VM. There is no host that = satisfies current scheduling constraints. See below for details:, The = host lago-basic-suite-master-host0 did not satisfy internal filter = CPUOverloaded because its CPU is too loaded.] 2017-07-02 16:43:23,331-04 DEBUG = [org.ovirt.engine.core.utils.timer.FixedDelayJobListener] = (DefaultQuartzScheduler2) [] Rescheduling = DEFAULT.org.ovirt.engine.core.bll.HaAutoStartVmsRunner.startFailedAutoStar= tVms#-9223372036854775793 as there is no unfired trigger. 2017-07-02 16:43:23,332-04 DEBUG = [org.ovirt.engine.core.utils.timer.FixedDelayJobListener] = (DefaultQuartzScheduler2) [] Rescheduling = DEFAULT.org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller.invokeCallb= ackMethods#-9223372036854775783 as there is no unfired trigger. =20 <engine log> =20 =20 =20 Best Regards, =20 Shlomi Ben-David | Software Engineer | Red Hat ISRAEL RHCSA | RHCVA | RHCE IRC: shlomibendavid (on #rhev-integ, #rhev-dev, #rhev-ci) =20 OPEN SOURCE - 1 4 011 && 011 4 1 =20 _______________________________________________ Devel mailing list Devel@ovirt.org http://lists.ovirt.org/mailman/listinfo/devel
--Apple-Mail=_F6BEE682-0833-4DDF-AC85-B3A2BC1EDFF6 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8 <html><head><meta http-equiv=3D"Content-Type" content=3D"text/html = charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; = -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" = class=3D""><br class=3D""><div><blockquote type=3D"cite" class=3D""><div = class=3D"">On 3 Jul 2017, at 15:35, Shlomo Ben David <<a = href=3D"mailto:sbendavi@redhat.com" class=3D"">sbendavi@redhat.com</a>>= wrote:</div><br class=3D"Apple-interchange-newline"><div class=3D""><div = dir=3D"ltr" class=3D"">Hi,<br class=3D""><br class=3D"">Test failed: [ = 006_migrations.migrate_vm ]<br class=3D"">Link to suspected patches: = N/A<br class=3D"">Link to Job: <a = href=3D"http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_master/7= 431/" = class=3D"">http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_maste= r/7431/</a><br class=3D"">Link to all logs: <br class=3D"">Error = snippet from the log: <a = href=3D"http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_master/7= 431/artifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suit= e-master/post-006_migrations.py/" = class=3D"">http://jenkins.ovirt.org/job/test-repo_ovirt_experimental_maste= r/7431/artifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-s= uite-master/post-006_migrations.py/</a><br class=3D""><br = class=3D""><error><br class=3D""><br class=3D""> "Fault = reason is "Operation Failed". Fault detail is "[Cannot migrate VM. There = is no host that satisfies current scheduling constraints. See below for = details:, The host lago-basic-suite-master-host0 did not satisfy = internal filter CPUOverloaded because its CPU is too loaded.]"<div = class=3D""><br class=3D""></div><div class=3D""></error><br = class=3D""></div><div class=3D""><br class=3D""></div><div = class=3D""><engine log><br class=3D""></div><div class=3D""><br = class=3D""></div><div class=3D""><div class=3D"">2017-07-02 = 16:43:22,829-04 INFO = [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default = task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Lock Acquired to object = 'EngineLock:{exclusiveLocks=3D'[2b34910d-cef2-44d6-a274-30e8473eb5d9=3DVM]= ', sharedLocks=3D''}'</div><div class=3D"">2017-07-02 16:43:22,833-04 = DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = Compiled stored procedure. Call string is [{call = getdiskvmelementspluggedtovm(?)}]</div><div class=3D"">2017-07-02 = 16:43:22,833-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = SqlCall for procedure [GetDiskVmElementsPluggedToVm] compiled</div><div = class=3D"">2017-07-02 16:43:22,843-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = Compiled stored procedure. Call string is [{call = getattacheddisksnapshotstovm(?, ?)}]</div><div class=3D"">2017-07-02 = 16:43:22,843-04 DEBUG = [org.ovirt.engine.core.dal.dbbroker.PostgresDbEngineDialect$PostgresSimple= JdbcCall] (default task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] = SqlCall for procedure [GetAttachedDiskSnapshotsToVm] compiled</div><div = class=3D"">2017-07-02 16:43:22,919-04 INFO = [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default = task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Candidate host = 'lago-basic-suite-master-host0' ('46bdc63d-98f5-4eee-81aa-2fb88b8f7cbe') = was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPUOverloaded' = (correlation id: null)</div><div class=3D"">2017-07-02 16:43:22,920-04 = WARN [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default = task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Validation of action = 'MigrateVmToServer' failed for user admin@internal-authz. Reasons: = VAR__ACTION__MIGRATE,VAR__TYPE__VM,SCHEDULING_ALL_HOSTS_FILTERED_OUT,VAR__= FILTERTYPE__INTERNAL,$hostName lago-basic-suite-master-host0,$filterName = CPUOverloaded,VAR__DETAIL__CPU_OVERLOADED,SCHEDULING_HOST_FILTERED_REASON_= WITH_DETAIL</div></div></div></div></blockquote><div><br = class=3D""><div><br class=3D""></div><div>This has nothing to do with = migration<br class=3D"">The CPUOverload is a scheduling policy, unless = there was any change in that area the obvious explanation would be that = the host has a CPU overload condition.<br class=3D"">I briefly looked at = logs and see ""cpuUser": "83.40", "cpuSys": "16.59", "cpuIdle": = =E2=80=9C0.08=E2=80=9D=E2=80=9D which indeed suggests an overload, from = the same sample I can see it=E2=80=99s vdsm ("cpuUserVdsmd": = =E2=80=9C77.38=E2=80=9D, cpuSysVdsmd": =E2=80=9C18.44"<br = class=3D""><br class=3D""></div>Since similar values are consistently = being reported for some time, and there is a setupNetworks and storage = rescan prior to the the failure, and there is no other indication of = anything wrong, I=E2=80=99d just say the environment or the order of = tests or timing has changed, but nothing wrong with the oVirt = code</div><div>Did any of that changed recently? Does it reproduce = locally?</div><div><br = class=3D""></div><div>Thanks,</div><div>michal</div><div><br = class=3D""></div><blockquote type=3D"cite" class=3D""><div class=3D""><div= dir=3D"ltr" class=3D""><div class=3D""><div class=3D"">2017-07-02 = 16:43:22,920-04 INFO = [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default = task-27) [87508047-fdc5-4a2f-9692-c83f7b55bbc2] Lock freed to object = 'EngineLock:{exclusiveLocks=3D'[2b34910d-cef2-44d6-a274-30e8473eb5d9=3DVM]= ', sharedLocks=3D''}'</div><div class=3D"">2017-07-02 16:43:22,929-04 = DEBUG [org.ovirt.engine.core.utils.timer.FixedDelayJobListener] = (DefaultQuartzScheduler7) [] Rescheduling <a href=3D"http://DEFAULT.org" = class=3D"">DEFAULT.org</a>.ovirt.engine.core.bll.ColdRebootAutoStartVmsRun= ner.startFailedAutoStartVms#-9223372036854775733 as there is no unfired = trigger.</div><div class=3D"">2017-07-02 16:43:22,932-04 ERROR = [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default = task-27) [] Operation Failed: [Cannot migrate VM. There is no host that = satisfies current scheduling constraints. See below for details:, The = host lago-basic-suite-master-host0 did not satisfy internal filter = CPUOverloaded because its CPU is too loaded.]</div><div = class=3D"">2017-07-02 16:43:23,331-04 DEBUG = [org.ovirt.engine.core.utils.timer.FixedDelayJobListener] = (DefaultQuartzScheduler2) [] Rescheduling <a href=3D"http://DEFAULT.org" = class=3D"">DEFAULT.org</a>.ovirt.engine.core.bll.HaAutoStartVmsRunner.star= tFailedAutoStartVms#-9223372036854775793 as there is no unfired = trigger.</div><div class=3D"">2017-07-02 16:43:23,332-04 DEBUG = [org.ovirt.engine.core.utils.timer.FixedDelayJobListener] = (DefaultQuartzScheduler2) [] Rescheduling <a href=3D"http://DEFAULT.org" = class=3D"">DEFAULT.org</a>.ovirt.engine.core.bll.tasks.CommandCallbacksPol= ler.invokeCallbackMethods#-9223372036854775783 as there is no unfired = trigger.</div><div class=3D""><br class=3D""></div><engine log><br = class=3D""><br class=3D""><div class=3D""><br class=3D""></div><br = clear=3D"all" class=3D""><div class=3D""><div = class=3D"gmail_signature"><div dir=3D"ltr" class=3D""><div class=3D""><div= dir=3D"ltr" class=3D""><div dir=3D"ltr" class=3D""><div dir=3D"ltr" = class=3D""><div dir=3D"ltr" class=3D""><div dir=3D"ltr" class=3D""><div = dir=3D"ltr" class=3D""><div class=3D"">Best Regards,</div><div dir=3D"ltr"= class=3D""><br class=3D""></div><div dir=3D"ltr" class=3D"">Shlomi = Ben-David | Software Engineer <span style=3D"font-size:small" = class=3D"">| </span><span style=3D"font-size:12.8px" class=3D"">Red = Hat ISRAEL</span></div><div class=3D"">RHCSA | <span = style=3D"font-size:small" class=3D"">RHCVA | </span><span = style=3D"font-size:small" class=3D"">RHCE</span></div><div dir=3D"ltr" = class=3D"">IRC: shlomibendavid <span style=3D"font-size:small" = class=3D"">(on #rhev-integ, #rhev-dev, #rhev-ci)</span><br class=3D""><br = class=3D"">OPEN SOURCE - 1 4 011 && 011 4 1<br class=3D""><br = class=3D""></div></div></div></div></div></div></div></div></div></div></d= iv> </div></div> _______________________________________________<br class=3D"">Devel = mailing list<br class=3D""><a href=3D"mailto:Devel@ovirt.org" = class=3D"">Devel@ovirt.org</a><br = class=3D"">http://lists.ovirt.org/mailman/listinfo/devel</div></blockquote=
</div><br class=3D""></body></html>=
--Apple-Mail=_F6BEE682-0833-4DDF-AC85-B3A2BC1EDFF6--