[NOTICE] Failing randomly on get_host_hooks with a NullPointerException

Hi, we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host. this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test). This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test, Thanks, Dafna

Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

patch submitted: https://gerrit.ovirt.org/#/c/98773/ Thanks, Dafna On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>

Yes, I'm repeating myself. SKIPPING TESTS IS BAD We have a test suite in order to fix bugs, not in order to kill itself. Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes. Please point them to a failing job, and record the failing traceback. On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>

Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg <danken@redhat.com> ha scritto:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg <danken@redhat.com> ha scritto:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
This is a good reason to rewrite OST with pytest, which continues on failure. And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code. It doesn't convince me that we should ignore the failure without due debugging.
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>

Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg <danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
This is a good reason to rewrite OST with pytest, which continues on failure.
Patches are welcome :-)
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.
It doesn't convince me that we should ignore the failure without due debugging.
Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg <danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
This is a good reason to rewrite OST with pytest, which continues on failure.
Patches are welcome :-)
This is not an empty gesture. The network suite came into being because of this issue (and others)
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.
It doesn't convince me that we should ignore the failure without due debugging.
Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.
The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
> Hi, > > we are randomly failing on get_host_hooks test for at least 3 weeks. > its not a specific branch or project and there are no commonalities > that I can see, aside from not being able to communicate with the host. > > this week its started happening at least once a day (this morning, 2 > out of 3 failures were due to that test). > > This test has been added by Yaniv Kaul over a year ago and he is no > longer working on ovirt I think someone else should take ownership of this > test and fix it. > Please let me know if you are intending to investigate and either > fix the failure or fix the test if not I will add a skip to the test, >
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
> > Thanks, > Dafna > > >
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>

Il giorno ven 22 mar 2019 alle ore 12:42 Dan Kenigsberg <danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg < danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
This is a good reason to rewrite OST with pytest, which continues on failure.
Patches are welcome :-)
This is not an empty gesture. The network suite came into being because of this issue (and others)
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.
It doesn't convince me that we should ignore the failure without due debugging.
Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.
The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.
We didn't found a bug in the product we are testing, we found a bug in the test that still need to be identified. According to Dafna: "we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see," If it was a bug in the product I would have totally agreed with you, it couldn't have been ignored. I'm not saying to ignore this as well. Being a bug in the test itself I would rather prefer take a non reliable test off for further investigation on a development environment and ensure the rest of the tests are being executed in production environment finding bugs on the product if there are.
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
> > > Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> > ha scritto: > >> Hi, >> >> we are randomly failing on get_host_hooks test for at least 3 >> weeks. >> its not a specific branch or project and there are no commonalities >> that I can see, aside from not being able to communicate with the host. >> >> this week its started happening at least once a day (this morning, >> 2 out of 3 failures were due to that test). >> >> This test has been added by Yaniv Kaul over a year ago and he is no >> longer working on ovirt I think someone else should take ownership of this >> test and fix it. >> Please let me know if you are intending to investigate and either >> fix the failure or fix the test if not I will add a skip to the test, >> > > Please add a skip to the test and if someone will step in > maintaining this test it will be re-enabled. > > > >> >> Thanks, >> Dafna >> >> >> > > -- > > SANDRO BONAZZOLA > > MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV > > Red Hat EMEA <https://www.redhat.com/> > > sbonazzo@redhat.com > <https://red.ht/sig> >
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
-- SANDRO BONAZZOLA MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV Red Hat EMEA <https://www.redhat.com/> sbonazzo@redhat.com <https://red.ht/sig>

On Fri, Mar 22, 2019 at 1:57 PM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 12:42 Dan Kenigsberg <danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg < danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
This is a good reason to rewrite OST with pytest, which continues on failure.
Patches are welcome :-)
This is not an empty gesture. The network suite came into being because of this issue (and others)
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.
It doesn't convince me that we should ignore the failure without due debugging.
Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.
The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.
We didn't found a bug in the product we are testing, we found a bug in the test that still need to be identified. According to Dafna: "we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see," If it was a bug in the product I would have totally agreed with you, it couldn't have been ignored. I'm not saying to ignore this as well. Being a bug in the test itself
I have no idea if this is the case. NullPointerException smells like something coming deep from Engine's data model
I would rather prefer take a non reliable test off for further investigation on a development environment and ensure the rest of the tests are being executed in production environment finding bugs on the product if there are.
Skip is still on the table as an option. The infra team may request us to use it. But we should first put the pressure on them to fix it properly. mperina and msobczyk are now aware of the issue; they should decide if they fix it now or asynchronously.

So another failure today. http://jenkins.ovirt.org/job/ovirt-4.3_change-queue-tester/405/ any updates? On Fri, Mar 22, 2019 at 3:08 PM Dan Kenigsberg <danken@redhat.com> wrote:
On Fri, Mar 22, 2019 at 1:57 PM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 12:42 Dan Kenigsberg < danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:21 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 11:14 Dan Kenigsberg < danken@redhat.com> ha scritto:
On Fri, 22 Mar 2019, 12:00 Sandro Bonazzola, <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 10:52 Dan Kenigsberg < danken@redhat.com> ha scritto:
> Yes, I'm repeating myself. > SKIPPING TESTS IS BAD >
I agree. And having the suite failing on a broken test skipping all the following tests is even worse. This is why I would prefer the rest of the product being tested while someone take ownership of the broken test and fix it.
This is a good reason to rewrite OST with pytest, which continues on failure.
Patches are welcome :-)
This is not an empty gesture. The network suite came into being because of this issue (and others)
And a good reason to ping mperina on IRC to debug this. And a good reason not to merge new code.
It doesn't convince me that we should ignore the failure without due debugging.
Debugging in indeed needed but not on production system blocking the rest of the CI. Maintainer of the test can debug it on own test environment.
The product of this system are bugs. We found one. If you skip it, we all risk it being forgotten. Skipping should be rare, and happen only after the owner is found and admits that he is too busy/lazy to fix it now, and files a bug to fix it later.
We didn't found a bug in the product we are testing, we found a bug in the test that still need to be identified. According to Dafna: "we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see," If it was a bug in the product I would have totally agreed with you, it couldn't have been ignored. I'm not saying to ignore this as well. Being a bug in the test itself
I have no idea if this is the case. NullPointerException smells like something coming deep from Engine's data model
I would rather prefer take a non reliable test off for further investigation on a development environment and ensure the rest of the tests are being executed in production environment finding bugs on the product if there are.
Skip is still on the table as an option. The infra team may request us to use it. But we should first put the pressure on them to fix it properly. mperina and msobczyk are now aware of the issue; they should decide if they fix it now or asynchronously.

Hi Dan, I could not agree more that skipping a test is bad. This has been reported several times and all managers and developers are on the mailing list so if they choose to ignore it and not take ownership, and the test is effecting all other projects, we will have no choice. Martin is on the mail and can take ownership at any point. build is: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/13508/ Thanks, Dafna On Fri, Mar 22, 2019 at 9:52 AM Dan Kenigsberg <danken@redhat.com> wrote:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>

On Fri, Mar 22, 2019 at 11:00 AM Dafna Ron <dron@redhat.com> wrote:
Hi Dan,
I could not agree more that skipping a test is bad. This has been reported several times and all managers and developers are on the mailing list so if they choose to ignore it and not take ownership, and the test is effecting all other projects, we will have no choice. Martin is on the mail and can take ownership at any point.
build is: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/13508/
Marcin, could you please take a look?
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:52 AM Dan Kenigsberg <danken@redhat.com> wrote:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <https://red.ht/sig>
-- Martin Perina Manager, Software Engineering Red Hat Czech s.r.o.

Sure, I'm on it. On 3/22/19 11:18 AM, Martin Perina wrote:
On Fri, Mar 22, 2019 at 11:00 AM Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
Hi Dan,
I could not agree more that skipping a test is bad. This has been reported several times and all managers and developers are on the mailing list so if they choose to ignore it and not take ownership, and the test is effecting all other projects, we will have no choice. Martin is on the mail and can take ownership at any point.
build is: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/13508/
Marcin, could you please take a look?
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:52 AM Dan Kenigsberg <danken@redhat.com <mailto:danken@redhat.com>> wrote:
Yes, I'm repeating myself. SKIPPING TESTS IS BAD
We have a test suite in order to fix bugs, not in order to kill itself.
Host hooks are Infra. Infra is mperina, rnori and msobczik. Please consult with them before you shut our collective eyes.
Please point them to a failing job, and record the failing traceback.
On Fri, 22 Mar 2019, 11:27 Dafna Ron, <dron@redhat.com <mailto:dron@redhat.com>> wrote:
patch submitted: https://gerrit.ovirt.org/#/c/98773/
Thanks, Dafna
On Fri, Mar 22, 2019 at 9:04 AM Sandro Bonazzola <sbonazzo@redhat.com <mailto:sbonazzo@redhat.com>> wrote:
Il giorno ven 22 mar 2019 alle ore 09:34 Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> ha scritto:
Hi,
we are randomly failing on get_host_hooks test for at least 3 weeks. its not a specific branch or project and there are no commonalities that I can see, aside from not being able to communicate with the host.
this week its started happening at least once a day (this morning, 2 out of 3 failures were due to that test).
This test has been added by Yaniv Kaul over a year ago and he is no longer working on ovirt I think someone else should take ownership of this test and fix it. Please let me know if you are intending to investigate and either fix the failure or fix the test if not I will add a skip to the test,
Please add a skip to the test and if someone will step in maintaining this test it will be re-enabled.
Thanks, Dafna
--
SANDRO BONAZZOLA
MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV
Red Hat EMEA <https://www.redhat.com/>
sbonazzo@redhat.com <mailto:sbonazzo@redhat.com>
-- Martin Perina Manager, Software Engineering Red Hat Czech s.r.o.
participants (5)
-
Dafna Ron
-
Dan Kenigsberg
-
Marcin Sobczyk
-
Martin Perina
-
Sandro Bonazzola