On 13 Nov 2018, at 12:20, Dominik Holler <dholler@redhat.com> wrote:

On Tue, 13 Nov 2018 11:56:37 +0100
Martin Perina <mperina@redhat.com> wrote:

On Tue, Nov 13, 2018 at 11:02 AM Dafna Ron <dron@redhat.com> wrote:

Martin? can you please look at the patch that Dominik sent?
We need to resolve this as we have not had an engine build for the last 11
days


Yesterday I've merged Dominik's revert patch https://gerrit.ovirt.org/95377
which should switch cluster level back to 4.2. Below mentioned change
https://gerrit.ovirt.org/95310 is relevant only to cluster level 4.3, am I
right Michal?

The build mentioned
https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11121/
is from yesterday. Are we sure that it was executed only after #95377 was
merged? I'd like to see the results from latest
https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11127/
but unfortunately it already waits more than an hour for available hosts ...





https://gerrit.ovirt.org/#/c/95283/ results in
http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-el7-x86_64/8071/
which is used in
https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3489/parameters/
results in run_vms succeeding.

The next merged change
https://gerrit.ovirt.org/#/c/95310/ results in
http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-el7-x86_64/8072/
which is used in
https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3490/parameters/
results in run_vms failing with
2018-11-12 17:35:10,109-05 INFO  [org.ovirt.engine.core.bll.RunVmOnceCommand] (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] Running command: RunVmOnceCommand internal: false. Entities affected :  ID: d10aa133-b9b6-455d-8137-ab822d1c1971 Type: VMAction group RUN_VM with role type USER
2018-11-12 17:35:10,113-05 DEBUG [org.ovirt.engine.core.common.di.interceptor.DebugLoggingInterceptor] (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] method: getVmManager, params: [d10aa133-b9b6-455d-8137-ab822d1c1971], timeElapsed: 4ms
2018-11-12 17:35:10,128-05 DEBUG [org.ovirt.engine.core.common.di.interceptor.DebugLoggingInterceptor] (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] method: getAllForClusterWithStatus, params: [2ca9ccd8-61f0-470c-ba3f-07766202f260, Up], timeElapsed: 7ms
2018-11-12 17:35:10,129-05 INFO  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] Candidate host 'lago-basic-suite-master-host-1' ('282860ab-8873-4702-a2be-100a6da111af') was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' (correlation id: 6930b632-5593-4481-bf2a-a1d8b14a583a)
2018-11-12 17:35:10,129-05 INFO  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] Candidate host 'lago-basic-suite-master-host-0' ('c48eca36-ea98-46b2-8473-f184833e68a8') was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' (correlation id: 6930b632-5593-4481-bf2a-a1d8b14a583a)
2018-11-12 17:35:10,130-05 ERROR [org.ovirt.engine.core.bll.RunVmCommand] (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] Can't find VDS to run the VM 'd10aa133-b9b6-455d-8137-ab822d1c1971' on, so this VM will not be run.
in
https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3490/artifact/exported-artifacts/test_logs/basic-suite-master/post-004_basic_sanity.py/lago-basic-suite-master-engine/_var_log/ovirt-engine/engine.log/*view*/

Is this helpful for you?


actually, there ire two issues
1) cluster is still 4.3 even after Martin’s revert. 
2) the patch is wrong too, as in HandleVdsCpuFlagsOrClusterChangedCommand it just goes ahead and sets the cluster cpu to whatever the host reported regardless if it is valid or not. Steven, please fix that (line 96 in backend/manager/modules/bll/src/main/java/org/ovirt/engine/core/bll/HandleVdsCpuFlagsOrClusterChangedCommand.java). It needs to pass the validation or we need some other solution. 
3) regardless, we should make 4.3 work too , I tried to play with it a bit in https://gerrit.ovirt.org/#/c/95407/, let’s see…

Thanks,
michal



On Mon, Nov 12, 2018 at 3:58 PM Dominik Holler <dholler@redhat.com> wrote:

On Mon, 12 Nov 2018 13:45:54 +0100
Martin Perina <mperina@redhat.com> wrote:

On Mon, Nov 12, 2018 at 12:58 PM Dominik Holler <dholler@redhat.com>  
wrote:  

On Mon, 12 Nov 2018 12:29:17 +0100
Martin Perina <mperina@redhat.com> wrote:

On Mon, Nov 12, 2018 at 12:20 PM Dafna Ron <dron@redhat.com> wrote:

There are currently two issues failing ovirt-engine on CQ ovirt  
master:  

1. edit vm pool is causing failure in different tests. it has a  
patch  
*waiting  
to be merged*: https://gerrit.ovirt.org/#/c/95354/


Merged


2. we have a failure in upgrade suite as well to run vm but this  
seems  
to  
be related to the tests as well:
2018-11-12 05:41:07,831-05 WARN
[org.ovirt.engine.core.bll.validator.VirtIoRngValidator]  
(default  
task-1)  
[] Random number source URANDOM is not supported in cluster  
'test-cluster'  
compatibility version 4.0.

here is the full error from the upgrade suite failure in run vm:
https://pastebin.com/XLHtWGGx

Here is the latest failure:


https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11118/



I will try to take a look later today


I have the idea that this might be related to
https://gerrit.ovirt.org/#/c/95377/ , and I check in

https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3485/console  
, but I have to stop now, if not solved I can go on later today.


OK, both CI and above manual OST job went fine, so I've just merged the
revert patch. I will take a look at it later in detail, we should  
really be  
testing 4.3 on master and not 4.2


Ack.

Now

https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11121/
is failing on
File
"/home/jenkins/workspace/ovirt-master_change-queue-tester/ovirt-system-tests/basic-suite-master/test-scenarios/004_basic_sanity.py",
line 698, in run_vms
   api.vms.get(VM0_NAME).start(start_params)
status: 400
reason: Bad Request

2018-11-12 10:06:30,722-05 INFO
[org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-3)
[b8d11cb0-5be9-4b7e-b45a-c95fa1f18681] Candidate host
'lago-basic-suite-master-host-1' ('dbfe1b0c-f940-4dba-8fb1-0cfe5ca7ddfc')
was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level'
(correlation id: b8d11cb0-5be9-4b7e-b45a-c95fa1f18681)
2018-11-12 10:06:30,722-05 INFO
[org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-3)
[b8d11cb0-5be9-4b7e-b45a-c95fa1f18681] Candidate host
'lago-basic-suite-master-host-0' ('e83a63ca-381e-40db-acb2-65a3e7953e11')
was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level'
(correlation id: b8d11cb0-5be9-4b7e-b45a-c95fa1f18681)
2018-11-12 10:06:30,723-05 ERROR [org.ovirt.engine.core.bll.RunVmCommand]
(default task-3) [b8d11cb0-5be9-4b7e-b45a-c95fa1f18681] Can't find VDS to
run the VM '57a66eff-8cbf-4643-b045-43d4dda80c66' on, so this VM will not
be run.

Is this related to
https://gerrit.ovirt.org/#/c/95310/
?






Thanks,
Dafna




On Mon, Nov 12, 2018 at 9:23 AM Dominik Holler <  
dholler@redhat.com>  
wrote:  

On Sun, 11 Nov 2018 19:04:40 +0200
Dan Kenigsberg <danken@redhat.com> wrote:

On Sun, Nov 11, 2018 at 5:27 PM Eyal Edri <eedri@redhat.com>  
wrote:  



On Sun, Nov 11, 2018 at 5:24 PM Eyal Edri <eedri@redhat.com>  

wrote:  



On Sun, Nov 11, 2018 at 5:20 PM Dan Kenigsberg <  
danken@redhat.com>  
wrote:  

On Sun, Nov 11, 2018 at 4:36 PM Ehud Yonasi <  
eyonasi@redhat.com>  

wrote:  

Hey,
I've seen that CQ Master is not passing ovirt-engine for  
10  
days  
and fails on test suite called restore_vm0_networking  
here's a snap error regarding it:

https://pastebin.com/7msEYqKT

Link to a sample job with the error:




http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/11113/artifact/basic-suite.el7.x86_64/004_basic_sanity.py.junit.xml



I cannot follow this link because I'm 4 minutes too late

jenkins.ovirt.org uses an invalid security certificate.  
The  
certificate expired on November 11, 2018, 5:13:25 PM  
GMT+2. The  
current time is November 11, 2018, 5:17 PM.  


Yes, we're looking into that issue now.  


Fixed, you should be able to access it now.  

OST fails during restore_vm0_networking in line 101 of
004_basic_sanity.py while comparing
vm_service.get().status == state

It seems that instead of reporting back the VM status, Engine  
set  
garbage  
"The response content type 'text/html; charset=iso-8859-1'  
isn't the  
expected XML"


The relevant line in



https://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/11113/artifact/basic-suite.el7.x86_64/test_logs/basic-suite-master/post-004_basic_sanity.py/lago-basic-suite-master-engine/_var_log/httpd/ssl_access_log/*view*/

seems to be
192.168.201.1 - - [11/Nov/2018:04:27:43 -0500] "GET
/ovirt-engine/api/vms/26088164-d1a0-4254-a377-5d3c242c8105  
HTTP/1.1"  
503 299  
and I guess the 503 error message is sent in HTML instead of XML.

If I run manually
https://gerrit.ovirt.org/#/c/95354/
with latest build of engine-master



http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-el7-x86_64/8074/

basic suite seems to be happy:
https://jenkins.ovirt.org/view/oVirt system
tests/job/ovirt-system-tests_manual/3484/


I do not know what could cause that, and engine.log does not  
mention  
it. But it seems like a problem in engine API hence +Martin  
Perina  
and  
+Ondra Machacek .










Can some1 have a look at it and help to resolve the  
issue?  


_______________________________________________
Infra mailing list -- infra@ovirt.org
To unsubscribe send an email to infra-leave@ovirt.org
Privacy Statement:  
https://www.ovirt.org/site/privacy-policy/  
oVirt Code of Conduct:  
https://www.ovirt.org/community/about/community-guidelines/  
List Archives:  


https://lists.ovirt.org/archives/list/infra@ovirt.org/message/ZQAYWTLZJKGPJ25F33E6ICVDXQDYSKSQ/


_______________________________________________
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-leave@ovirt.org
Privacy Statement:  
https://www.ovirt.org/site/privacy-policy/  
oVirt Code of Conduct:  
https://www.ovirt.org/community/about/community-guidelines/  
List Archives:  


https://lists.ovirt.org/archives/list/devel@ovirt.org/message/R5LOJH73XCLLFOUTKPM5GUCS6PNNKGTE/





--

Eyal edri


MANAGER

RHV/CNV DevOps

EMEA VIRTUALIZATION R&D


Red Hat EMEA

TRIED. TESTED. TRUSTED.
phone: +972-9-7692018
irc: eedri (on #tlv #rhev-dev #rhev-integ)  



--

Eyal edri


MANAGER

RHV/CNV DevOps

EMEA VIRTUALIZATION R&D


Red Hat EMEA

TRIED. TESTED. TRUSTED.
phone: +972-9-7692018
irc: eedri (on #tlv #rhev-dev #rhev-integ)  
_______________________________________________
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-leave@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct:  
https://www.ovirt.org/community/about/community-guidelines/  
List Archives:  


https://lists.ovirt.org/archives/list/devel@ovirt.org/message/DA6Q5RE5JO3FYIKN2QLKLWMCUBQA2HBX/