I run it now locally using the extra sources as it runs in the CQ and it
didn't fail for me.
I will continue to investigate tomorrow,
Marcin, did you see this issue also in check_patch or only in CQ?
Regards,
Galit
On Mon, Mar 23, 2020 at 4:29 PM Galit Rosenthal <grosenth(a)redhat.com> wrote:
I will look at it.
On Mon, Mar 23, 2020 at 4:18 PM Martin Perina <mperina(a)redhat.com> wrote:
>
>
> On Mon, Mar 23, 2020 at 3:16 PM Marcin Sobczyk <msobczyk(a)redhat.com>
> wrote:
>
>>
>>
>> On 3/23/20 3:10 PM, Marcin Sobczyk wrote:
>> >
>> >
>> > On 3/23/20 2:53 PM, Nir Soffer wrote:
>> >> On Mon, Mar 23, 2020 at 3:26 PM Marcin Sobczyk
<msobczyk(a)redhat.com>
>> >> wrote:
>> >>>
>> >>>
>> >>> On 3/23/20 2:17 PM, Nir Soffer wrote:
>> >>>> On Mon, Mar 23, 2020 at 1:25 PM Marcin Sobczyk
>> >>>> <msobczyk(a)redhat.com> wrote:
>> >>>>>
>> >>>>> On 3/21/20 1:18 AM, Nir Soffer wrote:
>> >>>>>
>> >>>>> On Fri, Mar 20, 2020 at 9:35 PM Nir Soffer
<nsoffer(a)redhat.com>
>> >>>>> wrote:
>> >>>>>> Looks like infrastructure issue setting up storage on
engine host.
>> >>>>>>
>> >>>>>> Here are 2 failing builds with unrelated changes:
>> >>>>>>
https://jenkins.ovirt.org/job/ovirt-system-tests_manual/6677/
>> >>>>>>
https://jenkins.ovirt.org/job/ovirt-system-tests_manual/6678/
>> >>>>> Rebuilding still fails in setup_storage:
>> >>>>>
>> >>>>>
>>
https://jenkins.ovirt.org/job/ovirt-system-tests_manual/6679/testReport/
>> >>>>>
>> >>>>>
>>
https://jenkins.ovirt.org/job/ovirt-system-tests_manual/6680/testReport/
>> >>>>>
>> >>>>>
>> >>>>>> Is this a known issue?
>> >>>>>>
>> >>>>>> Error Message
>> >>>>>>
>> >>>>>> AssertionError: setup_storage.sh failed. Exit code is 1
assert 1
>> >>>>>> == 0 -1 +0
>> >>>>>>
>> >>>>>> Stacktrace
>> >>>>>>
>> >>>>>> prefix = <ovirtlago.prefix.OvirtPrefix object at
0x7f6fd2b998d0>
>> >>>>>>
>> >>>>>> @pytest.mark.run(order=14)
>> >>>>>> def test_configure_storage(prefix):
>> >>>>>> engine = prefix.virt_env.engine_vm()
>> >>>>>> result = engine.ssh(
>> >>>>>> [
>> >>>>>> '/tmp/setup_storage.sh',
>> >>>>>> ],
>> >>>>>> )
>> >>>>>>> assert result.code == 0,
'setup_storage.sh failed. Exit
>> >>>>>>> code is %s' % result.code
>> >>>>>> E AssertionError: setup_storage.sh failed. Exit
code is 1
>> >>>>>> E assert 1 == 0
>> >>>>>> E -1
>> >>>>>> E +0
>> >>>>>>
>> >>>>>>
>> >>>>>> The pytest traceback is nice, but in this case it is
does not
>> >>>>>> show any useful info.
>> >>>>>>
>> >>>>>> Since we run a script using ssh, the error message
should include
>> >>>>>> the process stdout and stderr
>> >>>>>> which probably can explain the failure.
>> >>>>> I posted
https://gerrit.ovirt.org/#/c/107830/ to improve
logging
>> >>>>> during storage setup.
>> >>>>> Unfortunately AFAICS it didn't fail, so I guess
we'll have to
>> >>>>> merge it and wait for a failed job to get some helpful
logs.
>> >>>> Thanks.
>> >>>>
>> >>>> It still fails for me with current code:
>> >>>>
>>
https://jenkins.ovirt.org/job/ovirt-system-tests_manual/6689/testReport/
>> >>>>
>> >>>>
>> >>>> Same when using current vdsm master.
>> >>> Updated the patch according to your suggestions and currently
trying
>> >>> out
>> >>> OST for the 4th time -
>> >>> all previous runs succeeded. I guess I'm out of luck :)
>> >> It succeeds on your local OST setup but fail on Jenkins?
>> > No, I mean jenkins - both check-patch runs didn't fail on this script.
>> > I also tried running OST manually twice and same thing happened.
>> > Anyway - the patch has been merged now so if any failure occurs in CQ
>> > we should know what's going on.
>> Ok, finally caught a failure in CQ [1]:
>>
>> [2020-03-23T14:14:09.836Z] if result.code != 0:
>> [2020-03-23T14:14:09.836Z] msg = (
>> [2020-03-23T14:14:09.836Z] 'setup_storage.sh failed with
>> exit code: {}.\n'
>> [2020-03-23T14:14:09.836Z] 'stdout:\n{}'
>> [2020-03-23T14:14:09.836Z] 'stderr:\n{}'
>> [2020-03-23T14:14:09.836Z] ).format(result.code, result.out,
>> result.err)
>> [2020-03-23T14:14:09.836Z] > raise RuntimeError(msg)
>> [2020-03-23T14:14:09.836Z] E RuntimeError: setup_storage.sh
>> failed with exit code: 1.
>> [2020-03-23T14:14:09.836Z] E stdout:
>> [2020-03-23T14:14:09.836Z] E Reposync & Extra Sources
>> Content 0.0 B/s | 0 B 00:00
>> [2020-03-23T14:14:09.836Z] E stderr:
>> [2020-03-23T14:14:09.836Z] E + set -xe
>> [2020-03-23T14:14:09.836Z] E +
>> MAIN_NFS_DEV=disk/by-id/scsi-0QEMU_QEMU_HARDDISK_2
>> [2020-03-23T14:14:09.836Z] E +
>> ISCSI_DEV=disk/by-id/scsi-0QEMU_QEMU_HARDDISK_3
>> [2020-03-23T14:14:09.836Z] E + NUM_LUNS=5
>> [2020-03-23T14:14:09.836Z] E ++ uname -r
>> [2020-03-23T14:14:09.836Z] E ++ awk -F. '{print $(NF-1)}'
>> [2020-03-23T14:14:09.836Z] E + DIST=el8_1
>> [2020-03-23T14:14:09.836Z] E + main
>> [2020-03-23T14:14:09.836Z] E ++ hostname
>> [2020-03-23T14:14:09.836Z] E + [[
>> lago-basic-suite-master-engine == *\i\p\v\6* ]]
>> [2020-03-23T14:14:09.836Z] E + install_deps
>> [2020-03-23T14:14:09.836Z] E + systemctl disable --now
>> kdump.service
>> [2020-03-23T14:14:09.836Z] E Removed
>> /etc/systemd/system/multi-user.target.wants/kdump.service.
>> [2020-03-23T14:14:09.836Z] E + yum install --nogpgcheck -y
>> nfs-utils rpcbind lvm2 targetcli sg3_utils iscsi-initiator-utils lsscsi
>> policycoreutils-python-utils
>> [2020-03-23T14:14:09.836Z] E Failed to download metadata for
>> repo 'alocalsync'
>> [2020-03-23T14:14:09.836Z] E Error: Failed to download
>> metadata for repo 'alocalsync'
>>
>>
>> [1]
>>
>>
https://jenkins.ovirt.org/blue/organizations/jenkins/ovirt-master_change-...
>
>
> Galit, could you please take a look?
>
>>
>>
>> >
>> >>
>> >>>>>> Also I wonder why this code is called as a test
>> >>>>>> (test_configure_storage). This looks like setup
>> >>>>>> step so it should run as a fixture.
>> >>>>> That's true, but the pytest porting effort was about
providing a
>> >>>>> bare minimum to move away from nose.
>> >>>>> Organizing the tests into proper setup/fixtures is a huge
task and
>> >>>>> will be probably implemented
>> >>>>> incrementally in the nearest future.
>> >>>> Understood
>> >>>>
>> >
>>
>>
>
> --
> Martin Perina
> Manager, Software Engineering
> Red Hat Czech s.r.o.
>
--
GALIT ROSENTHAL
SOFTWARE ENGINEER
Red Hat
<
https://www.redhat.com/>
galit(a)redhat.com T: 972-9-7692230
<
https://red.ht/sig>