[NOTICE] Failing randomly on get_host_hooks with a NullPointerException

Dafna Ron

22 Mar 2019 22 Mar '19

9:34 a.m.

Hi, we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host. this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test). This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test, Thanks, Dafna

Attachments:

attachment.html (text/html — 883 bytes)

Show replies by date

Sandro Bonazzola

22 Mar 22 Mar

10:03 a.m.

Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...

Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...

Thanks, Dafna

-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

Dafna Ron

10:27 a.m.

patch submitted: https://gerrit.ovirt.org/#/c/98773/ Thanks, Dafna On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...

Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

Dan Kenigsberg

10:51 a.m.

Yes, I'm repeating myself. SKIPPING TESTS IS BAD We have a test suite in order to fix bugs, not in order to kill itself. Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes. Please point them to a failing job, and record the failing traceback. On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...

patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

Sandro Bonazzola

10:59 a.m.

Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg <danken@redhat.com> ha scritto:

...

Yes, I'm repeating myself. SKIPPING TESTS IS BAD

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

...

We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

Dan Kenigsberg

11:14 a.m.

On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...

Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg <danken@redhat.com> ha scritto:

...
Yes, I'm repeating myself. SKIPPING TESTS IS BAD

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

This is a good reason to rewrite OST with pytest, which continues on failure. And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code. It doesn't convince me that we should ignore the failure without due debugging.

...

...
We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

Sandro Bonazzola

11:20 a.m.

Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg <danken@redhat.com> ha scritto:

...

On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
Yes, I'm repeating myself. SKIPPING TESTS IS BAD

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

This is a good reason to rewrite OST with pytest, which continues on failure.

Patches are welcome :-)

...

And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.

It doesn't convince me that we should ignore the failure without due debugging.

Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.

...

...
...
We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

Dan Kenigsberg

12:42 p.m.

On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...

Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg <danken@redhat.com> ha scritto:

...
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
Yes, I'm repeating myself. SKIPPING TESTS IS BAD

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

This is a good reason to rewrite OST with pytest, which continues on failure.

Patches are welcome :-)

This is not an empty gesture. The network suite came into being because of this issue (and others)

...

...
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.

It doesn't convince me that we should ignore the failure without due debugging.

Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.

The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.

...

...
...
...
We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

> Hi, > > we are randomly failing on get_host_hooks test for at least 3 weeks. > its not a specific branch or project and there are no commonalities > that I can see, aside from not being able to communicate with the host. > > this week its started happening at least once a day (this morning, 2 > out of 3 failures were due to that test). > > This test has been added by Yaniv Kaul over a year ago and he is no > longer working on ovirt I think someone else should take ownership of this > test and fix it. > Please let me know if you are intending to investigate and either > fix the failure or fix the test if not I will add a skip to the test, >

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

> > Thanks, > Dafna > > >

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

Sandro Bonazzola

12:57 p.m.

Il giorno ven 22 mar 2019 alle ore 12:42 Dan Kenigsberg <danken@redhat.com> ha scritto:

...

On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
Yes, I'm repeating myself. SKIPPING TESTS IS BAD

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

This is a good reason to rewrite OST with pytest, which continues on failure.

Patches are welcome :-)

This is not an empty gesture. The network suite came into being because of this issue (and others)

...
...
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.

It doesn't convince me that we should ignore the failure without due debugging.

Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.

The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.

We didn't found a bug in the product we are testing, we found a bug in the test that still need to be identified. According to Dafna: "we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see," If it was a bug in the product I would have totally agreed with you, it couldn't have been ignored. I'm not saying to ignore this as well. Being a bug in the test itself I would rather prefer take a non reliable test off for further investigation on a development environment and ensure the rest of the tests are being executed in production environment finding bugs on the product if there are.

...

...
...
...
...
We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

> > > Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> > ha scritto: > >> Hi, >> >> we are randomly failing on get_host_hooks test for at least 3 >> weeks. >> its not a specific branch or project and there are no commonalities >> that I can see, aside from not being able to communicate with the host. >> >> this week its started happening at least once a day (this morning, >> 2 out of 3 failures were due to that test). >> >> This test has been added by Yaniv Kaul over a year ago and he is no >> longer working on ovirt I think someone else should take ownership of this >> test and fix it. >> Please let me know if you are intending to investigate and either >> fix the failure or fix the test if not I will add a skip to the test, >> > > Please add a skip to the test and if someone will step in > maintaining this test it will be re-enabled. > > > >> >> Thanks, >> Dafna >> >> >> > > -- > > SANDRO BONAZZOLA > > MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV > > Red Hat EMEA <https://www.redhat.com/> > > sbonazzo@redhat.com > <https://red.ht/sig> >

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

Dan Kenigsberg

4:08 p.m.

On Fri, Mar 22, 2019 at 1:57 PM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...

Il giorno ven 22 mar 2019 alle ore 12:42 Dan Kenigsberg <danken@redhat.com> ha scritto:

...
On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
Yes, I'm repeating myself. SKIPPING TESTS IS BAD

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

This is a good reason to rewrite OST with pytest, which continues on failure.

Patches are welcome :-)

This is not an empty gesture. The network suite came into being because of this issue (and others)

...
...
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.

It doesn't convince me that we should ignore the failure without due debugging.

Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.

The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.

We didn't found a bug in the product we are testing, we found a bug in the test that still need to be identified. According to Dafna: "we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see," If it was a bug in the product I would have totally agreed with you, it couldn't have been ignored. I'm not saying to ignore this as well. Being a bug in the test itself

I have no idea if this is the case. NullPointerException smells like something coming deep from Engine's data model

...

I would rather prefer take a non reliable test off for further investigation on a development environment and ensure the rest of the tests are being executed in production environment finding bugs on the product if there are.

Skip is still on the table as an option. The infra team may request us to use it. But we should first put the pressure on them to fix it properly. mperina and msobczyk are now aware of the issue; they should decide if they fix it now or asynchronously.

Dafna Ron

26 Mar 26 Mar

2:57 p.m.

So another failure today. http://jenkins.ovirt.org/job/ovirt-4.3_change-queue-tester/405/ any updates? On Fri, Mar 22, 2019 at 3:08 PM Dan Kenigsberg <danken@redhat.com> wrote:

...

On Fri, Mar 22, 2019 at 1:57 PM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 12:42 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg < danken@redhat.com> ha scritto:

...
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:

> Yes, I'm repeating myself. > SKIPPING TESTS IS BAD >

I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.

This is a good reason to rewrite OST with pytest, which continues on failure.

Patches are welcome :-)

This is not an empty gesture. The network suite came into being because of this issue (and others)

...
...
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.

It doesn't convince me that we should ignore the failure without due debugging.

Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.

The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.

We didn't found a bug in the product we are testing, we found a bug in the test that still need to be identified. According to Dafna: "we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see," If it was a bug in the product I would have totally agreed with you, it couldn't have been ignored. I'm not saying to ignore this as well. Being a bug in the test itself

I have no idea if this is the case. NullPointerException smells like something coming deep from Engine's data model

...
I would rather prefer take a non reliable test off for further investigation on a development environment and ensure the rest of the tests are being executed in production environment finding bugs on the product if there are.

Skip is still on the table as an option. The infra team may request us to use it. But we should first put the pressure on them to fix it properly. mperina and msobczyk are now aware of the issue; they should decide if they fix it now or asynchronously.

Dafna Ron

22 Mar 22 Mar

11 a.m.

Hi Dan, I could not agree more that skipping a test is bad. This has been reported several times and all managers and developers are on the mailing list so if they choose to ignore it and not take ownership, and the test is effecting all other projects, we will have no choice. Martin is on the mail and can take ownership at any point. build is: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/13508/ Thanks, Dafna On Fri, Mar 22, 2019 at 9:52 AM Dan Kenigsberg <danken@redhat.com> wrote:

...

Yes, I'm repeating myself. SKIPPING TESTS IS BAD

We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

Martin Perina

11:18 a.m.

On Fri, Mar 22, 2019 at 11:00 AM Dafna Ron <dron@redhat.com> wrote:

...

Hi Dan,

I could not agree more that skipping a test is bad. This has been reported several times and all managers and developers are on the mailing list so if they choose to ignore it and not take ownership, and the test is effecting all other projects, we will have no choice. Martin is on the mail and can take ownership at any point.

build is: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/13508/

Marcin, could you please take a look?

...

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:52 AM Dan Kenigsberg <danken@redhat.com> wrote:

...
Yes, I'm repeating myself. SKIPPING TESTS IS BAD

We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:

...
patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:

...
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:

...
Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

...
Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <https://red.ht/sig>

-- Martin Perina Manager, Software Engineering Red Hat Czech s.r.o.

Marcin Sobczyk

11:31 a.m.

New subject: [NOTICE] Failing randomly on get_host_hooks with a NullPointerException

Sure, I'm on it. On 3/22/19 11:18 AM, Martin Perina wrote:

...

On Fri, Mar 22, 2019 at 11:00 AM Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:

Hi Dan,

I could not agree more that skipping a test is bad. This has been reported several times and all managers and developers are on the mailing list so if they choose to ignore it and not take ownership, and the test is effecting all other projects, we will have no choice. Martin is on the mail and can take ownership at any point.

build is: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/13508/

Marcin, could you please take a look?

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:52 AM Dan Kenigsberg <danken@redhat.com <mailto:danken@redhat.com>> wrote:

Yes, I'm repeating myself. SKIPPING TESTS IS BAD

We have a test suite in order to fix bugs, not in order to kill itself.

Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.

Please point them to a failing job, and record the failing traceback.

On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com <mailto:dron@redhat.com>> wrote:

patch submitted: https://gerrit.ovirt.org/#/c/98773/

Thanks, Dafna

On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com <mailto:sbonazzo@redhat.com>> wrote:

Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> ha scritto:

Hi,

we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.

this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).

This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,

Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.

Thanks, Dafna

--

SANDRO BONAZZOLA

MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV

Red Hat EMEA <https://www.redhat.com/>

sbonazzo@redhat.com <mailto:sbonazzo@redhat.com>

<https://red.ht/sig>

-- Martin Perina Manager, Software Engineering Red Hat Czech s.r.o.

2568

Age (days ago)

2572

Last active (days ago)

List overview

Download

13 comments

5 participants

participants (5)

Dafna Ron
Dan Kenigsberg
Marcin Sobczyk
Martin Perina
Sandro Bonazzola

[NOTICE] Failing randomly on get_host_hooks with a NullPointerException

tags

participants (5)