[JIRA] (OVIRT-1339) Re: Jenkins check-merged failures

Milan Zamazal (oVirt JIRA) jira at ovirt-jira.atlassian.net
Tue Apr 25 11:50:03 UTC 2017


    [ https://ovirt-jira.atlassian.net/browse/OVIRT-1339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=30031#comment-30031 ] 

Milan Zamazal commented on OVIRT-1339:
--------------------------------------

Nadav Goldin <ngoldin at redhat.com> writes:

> Hi Milan,
> (Adding infra-support to open a ticket)
>
> For the first job, the automation/deploy.sh script failed, which means
> vdsm failed to install inside the VM created by Lago. I couldn't
> figure out why as the 'deploy.sh' script was missing the bash '-x'
> flag. The /var/log/messages doesn't show any VDSM logs, so I assume it
> failed before. Anyways, now that[1] is merged - it should be easier to
> debug this next time.
>
> For the second job - this is due to Lago internal reposerver still
> being up from a previous run on the slave. It seems that this[2] vdsm
> check-merged job on April 05 caused it, when it timed-out without
> terminating properly. This is quite rare I should say, we can keep
> this ticket to check if it happens again.
>
> Either way - I think both failures are unrelated, best would be(if
> still relevant - as check-merged probably ran a few times since) to
> re-trigger and see if it replicates.

Hi Nadav,

thank you for explanation.  I don't think the failures replicate;
hopefully their causes will be identified and fixed if their occur
again.

> [1] https://gerrit.ovirt.org/#/c/75348/2
> [2] http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/1492/consoleFull
>
> On Fri, Apr 7, 2017 at 10:32 AM, Milan Zamazal <mzamazal at redhat.com> wrote:
>> Hi,
>>
>> a series of 4 my Vdsm patches was merged yesterday and Jenkins has
>> failed on two of them in check-merged.  See
>> http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/1504/
>> and
>> http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/1506/.
>>
>> The corresponding errors were:
>>
>> 16:20:09 + lago ovirt deploy
>> 16:20:09 current session does not belong to lago group.
>> 16:20:09 @ Deploy oVirt environment:
>> 16:20:09 # ovirt-role metadata entry will be soon deprecated, instead you
>> should use the vm-provider entry in the domain definition and set it no one
>> of: ovirt-node, ovirt-engine, ovirt-host
>> 16:20:09   # Deploy environment:
>> 16:20:09     * [Thread-2] Deploy VM vdsm_functional_tests_host-el7:
>> 16:20:23       - STDERR
>> 16:20:23
>> 16:20:23
>> 16:20:23 Exiting on user cancel
>> 16:20:23
>> 16:20:23     * [Thread-2] Deploy VM vdsm_functional_tests_host-el7: ERROR (in 0:00:13)
>> 16:20:23 Error while running thread
>> 16:20:23 Traceback (most recent call last):
>> 16:20:23   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 57, in _ret_via_queue
>> 16:20:23     queue.put({'return': func()})
>> 16:20:23   File "/usr/lib/python2.7/site-packages/lago/prefix.py", line 1339, in _deploy_host
>> 16:20:23     host.name(),
>> 16:20:23 RuntimeError:
>> /home/jenkins/workspace/vdsm_master_check-merged-el7-x86_64/vdsm/automation/vdsm_functional/default/scripts/_home_jenkins_workspace_vdsm_master_check-merged-el7-x86_64_vdsm_automation_deploy.sh
>> failed with status 1 on vdsm_functional_tests_host-el7
>> 16:20:23   # Deploy environment: ERROR (in 0:00:13)
>> 16:20:23 @ Deploy oVirt environment: ERROR (in 0:00:14)
>> 16:20:23 Error occured, aborting
>>
>> and
>>
>> 16:21:32 + lago ovirt deploy
>> 16:21:33 current session does not belong to lago group.
>> 16:21:33 @ Deploy oVirt environment:
>> 16:21:33 # ovirt-role metadata entry will be soon deprecated, instead you
>> should use the vm-provider entry in the domain definition and set it no one
>> of: ovirt-node, ovirt-engine, ovirt-host
>> 16:21:33 @ Deploy oVirt environment: ERROR (in 0:00:00)
>> 16:21:33 Error occured, aborting
>> 16:21:33 Traceback (most recent call last):
>> 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/cmd.py", line 303, in do_run
>> 16:21:33     self.cli_plugins[args.ovirtverb].do_run(args)
>> 16:21:33   File "/usr/lib/python2.7/site-packages/lago/plugins/cli.py", line 184, in do_run
>> 16:21:33     self._do_run(**vars(args))
>> 16:21:33   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 495, in wrapper
>> 16:21:33     return func(*args, **kwargs)
>> 16:21:33   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 506, in wrapper
>> 16:21:33     return func(*args, prefix=prefix, **kwargs)
>> 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/cmd.py", line 164, in do_deploy
>> 16:21:33     prefix.deploy()
>> 16:21:33   File "/usr/lib/python2.7/site-packages/lago/log_utils.py", line 633, in wrapper
>> 16:21:33     return func(*args, **kwargs)
>> 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/reposetup.py", line 110, in wrapper
>> 16:21:33     with utils.repo_server_context(args[0]):
>> 16:21:33   File "/usr/lib64/python2.7/contextlib.py", line 17, in __enter__
>> 16:21:33     return self.gen.next()
>> 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/utils.py", line 97, in repo_server_context
>> 16:21:33     root_dir=prefix.paths.internal_repo(),
>> 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/utils.py", line 73, in _create_http_server
>> 16:21:33     generate_request_handler(root_dir),
>> 16:21:33   File "/usr/lib64/python2.7/SocketServer.py", line 419, in __init__
>> 16:21:33     self.server_bind()
>> 16:21:33   File "/usr/lib64/python2.7/BaseHTTPServer.py", line 108, in server_bind
>> 16:21:33     SocketServer.TCPServer.server_bind(self)
>> 16:21:33   File "/usr/lib64/python2.7/SocketServer.py", line 430, in server_bind
>> 16:21:33     self.socket.bind(self.server_address)
>> 16:21:33   File "/usr/lib64/python2.7/socket.py", line 224, in meth
>> 16:21:33     return getattr(self._sock,name)(*args)
>> 16:21:33 error: [Errno 98] Address already in use
>>
>> Do you know what's wrong?
>>
>> Thanks,
>> Milan
>> _______________________________________________
>> Infra mailing list
>> Infra at ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/infra

> Re: Jenkins check-merged failures
> ---------------------------------
>
>                 Key: OVIRT-1339
>                 URL: https://ovirt-jira.atlassian.net/browse/OVIRT-1339
>             Project: oVirt - virtualization made easy
>          Issue Type: By-EMAIL
>            Reporter: Nadav Goldin
>            Assignee: infra
>
> Hi Milan,
> (Adding infra-support to open a ticket)
> For the first job, the automation/deploy.sh script failed, which means
> vdsm failed to install inside the VM created by Lago. I couldn't
> figure out why as the 'deploy.sh' script was missing the bash '-x'
> flag. The /var/log/messages doesn't show any VDSM logs, so I assume it
> failed before. Anyways, now that[1] is merged - it should be easier to
> debug this next time.
> For the second job - this is due to Lago internal reposerver still
> being up from a previous run on the slave. It seems that this[2] vdsm
> check-merged job on April 05 caused it, when it timed-out without
> terminating properly. This is quite rare I should say, we can keep
> this ticket to check if it happens again.
> Either way - I think both failures are unrelated, best would be(if
> still relevant - as check-merged probably ran a few times since) to
> re-trigger and see if it replicates.
> [1] https://gerrit.ovirt.org/#/c/75348/2
> [2] http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/1492/consoleFull
> On Fri, Apr 7, 2017 at 10:32 AM, Milan Zamazal <mzamazal at redhat.com> wrote:
> > Hi,
> >
> > a series of 4 my Vdsm patches was merged yesterday and Jenkins has
> > failed on two of them in check-merged.  See
> > http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/1504/
> > and
> > http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/1506/.
> >
> > The corresponding errors were:
> >
> > 16:20:09 + lago ovirt deploy
> > 16:20:09 current session does not belong to lago group.
> > 16:20:09 @ Deploy oVirt environment:
> > 16:20:09   # ovirt-role metadata entry will be soon deprecated, instead you should use the vm-provider entry in the domain definition and set it no one of: ovirt-node, ovirt-engine, ovirt-host
> > 16:20:09   # Deploy environment:
> > 16:20:09     * [Thread-2] Deploy VM vdsm_functional_tests_host-el7:
> > 16:20:23       - STDERR
> > 16:20:23
> > 16:20:23
> > 16:20:23 Exiting on user cancel
> > 16:20:23
> > 16:20:23     * [Thread-2] Deploy VM vdsm_functional_tests_host-el7: ERROR (in 0:00:13)
> > 16:20:23 Error while running thread
> > 16:20:23 Traceback (most recent call last):
> > 16:20:23   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 57, in _ret_via_queue
> > 16:20:23     queue.put({'return': func()})
> > 16:20:23   File "/usr/lib/python2.7/site-packages/lago/prefix.py", line 1339, in _deploy_host
> > 16:20:23     host.name(),
> > 16:20:23 RuntimeError: /home/jenkins/workspace/vdsm_master_check-merged-el7-x86_64/vdsm/automation/vdsm_functional/default/scripts/_home_jenkins_workspace_vdsm_master_check-merged-el7-x86_64_vdsm_automation_deploy.sh failed with status 1 on vdsm_functional_tests_host-el7
> > 16:20:23   # Deploy environment: ERROR (in 0:00:13)
> > 16:20:23 @ Deploy oVirt environment: ERROR (in 0:00:14)
> > 16:20:23 Error occured, aborting
> >
> > and
> >
> > 16:21:32 + lago ovirt deploy
> > 16:21:33 current session does not belong to lago group.
> > 16:21:33 @ Deploy oVirt environment:
> > 16:21:33   # ovirt-role metadata entry will be soon deprecated, instead you should use the vm-provider entry in the domain definition and set it no one of: ovirt-node, ovirt-engine, ovirt-host
> > 16:21:33 @ Deploy oVirt environment: ERROR (in 0:00:00)
> > 16:21:33 Error occured, aborting
> > 16:21:33 Traceback (most recent call last):
> > 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/cmd.py", line 303, in do_run
> > 16:21:33     self.cli_plugins[args.ovirtverb].do_run(args)
> > 16:21:33   File "/usr/lib/python2.7/site-packages/lago/plugins/cli.py", line 184, in do_run
> > 16:21:33     self._do_run(**vars(args))
> > 16:21:33   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 495, in wrapper
> > 16:21:33     return func(*args, **kwargs)
> > 16:21:33   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 506, in wrapper
> > 16:21:33     return func(*args, prefix=prefix, **kwargs)
> > 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/cmd.py", line 164, in do_deploy
> > 16:21:33     prefix.deploy()
> > 16:21:33   File "/usr/lib/python2.7/site-packages/lago/log_utils.py", line 633, in wrapper
> > 16:21:33     return func(*args, **kwargs)
> > 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/reposetup.py", line 110, in wrapper
> > 16:21:33     with utils.repo_server_context(args[0]):
> > 16:21:33   File "/usr/lib64/python2.7/contextlib.py", line 17, in __enter__
> > 16:21:33     return self.gen.next()
> > 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/utils.py", line 97, in repo_server_context
> > 16:21:33     root_dir=prefix.paths.internal_repo(),
> > 16:21:33   File "/usr/lib/python2.7/site-packages/ovirtlago/utils.py", line 73, in _create_http_server
> > 16:21:33     generate_request_handler(root_dir),
> > 16:21:33   File "/usr/lib64/python2.7/SocketServer.py", line 419, in __init__
> > 16:21:33     self.server_bind()
> > 16:21:33   File "/usr/lib64/python2.7/BaseHTTPServer.py", line 108, in server_bind
> > 16:21:33     SocketServer.TCPServer.server_bind(self)
> > 16:21:33   File "/usr/lib64/python2.7/SocketServer.py", line 430, in server_bind
> > 16:21:33     self.socket.bind(self.server_address)
> > 16:21:33   File "/usr/lib64/python2.7/socket.py", line 224, in meth
> > 16:21:33     return getattr(self._sock,name)(*args)
> > 16:21:33 error: [Errno 98] Address already in use
> >
> > Do you know what's wrong?
> >
> > Thanks,
> > Milan
> > _______________________________________________
> > Infra mailing list
> > Infra at ovirt.org
> > http://lists.ovirt.org/mailman/listinfo/infra



--
This message was sent by Atlassian JIRA
(v1000.910.0#100040)


More information about the Infra mailing list