<div dir="ltr">Barak - can you prioritize fixing this one?<div>It blocks us from adding additional functionality into the ovirt-ansible-roles repo.</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Aug 31, 2017 at 11:54 AM, Martin Perina <span dir="ltr"><<a href="mailto:mperina@redhat.com" target="_blank">mperina@redhat.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">So with ovirt-ansible-roles-1.1.0 (which is the last offically relased version) everything runs fine and host is added properly (tested several times on CentOS 7.3)<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">But we need to fix executing builds from github otherwise we cannot continue working with ovirt-ansible-roles in github:<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">1. If you add comment 'ci build please' to github PR, then build will be executed and the result will be a repo with RPM to be used in OST (but this build will not be passed to queue to be added to tested repo)<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">2. When PR is merged, then either automatically or by some other comment/action (available to maintainers only) build will be executed and if build is OK, it can be queued to be added to tested repo<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Without above we just cannot continue working on ovirt-ansible-roles on github.<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Thanks<span class="HOEnZb"><font color="#888888"><br><br></font></span></div><span class="HOEnZb"><font color="#888888"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Martin<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif"><br></div></font></span></div><div class="gmail_extra"><br><div class="gmail_quote"><span class="">On Thu, Aug 31, 2017 at 8:55 AM, Barak Korren <span dir="ltr"><<a href="mailto:bkorren@redhat.com" target="_blank">bkorren@redhat.com</a>></span> wrote:<br></span><div><div class="h5"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 30 August 2017 at 22:20, Martin Perina <<a href="mailto:mperina@redhat.com" target="_blank">mperina@redhat.com</a>> wrote:<br>
><br>
>><br>
>> So we're back in square one.<br>
>> Another possible culprit may be ansible: Vdsm is stopped two seconds<br>
>> after it logs to the host.<br>
>><br>
>> Aug 30 11:26:24 lago-basic-suite-master-host-0 systemd: Starting<br>
>> Session 10 of user root.<br>
>> Aug 30 11:26:25 lago-basic-suite-master-host-0 python: ansible-setup<br>
>> Invoked with filter=* gather_subset=['all']<br>
>> fact_path=/etc/ansible/facts.d gather_timeout=10<br>
>> Aug 30 11:26:25 lago-basic-suite-master-host-0 python: ansible-command<br>
>> Invoked with warn=True executable=None _uses_shell=False<br>
>> _raw_params=bash -c "rpm -qi vdsm | grep -oE<br>
>> 'Version\\s+:\\s+[0-9\\.]+' | awk '{print $3}'" removes=None<br>
>> creates=None chdir=None<br>
>> Aug 30 11:26:26 lago-basic-suite-master-host-0 python: ansible-systemd<br>
>> Invoked with no_block=False name=libvirt-guests enabled=True<br>
>> daemon_reload=False state=started user=False masked=None<br>
>> Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Reloading.<br>
>> Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Cannot add<br>
>> dependency job for unit lvm2-lvmetad.socket, ignoring: Unit is masked.<br>
>> Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Stopped MOM<br>
>> instance configured for VDSM purposes.<br>
>> Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Stopping<br>
>> Virtual Desktop Server Manager...<br>
>><br>
>><br>
>> could it be that it triggers a systemd-reload that makes systemd croak<br>
>> on the vdsm-mom cycle?<br>
><br>
><br>
> We are not restarting VDSM within ovirt-host-deploy Ansible role, the VDSM<br>
> restart is performed in host-deploy part same as in previous versions.<br>
><br>
> Within ovirt-host-deploy-firewalld we only enable and restart firewalld<br>
> service.<br>
><br>
<br>
comparing a successful add-host flow [1] to a failed one [2] we notice<br>
that in the failed add host ansible logs in twice (session 10 and<br>
session 11). Could it be somehow related? Notice that Session 11 uses<br>
the OLD way (awk+grep based) to find vdsm's version.<br>
<br>
Aug 30 05:55:53 lago-basic-suite-master-host-0 systemd-logind: New<br>
session 10 of user root.<br>
Aug 30 05:55:53 lago-basic-suite-master-host-0 systemd: Starting<br>
Session 10 of user root.<br>
Aug 30 05:55:53 lago-basic-suite-master-host-0 python: ansible-setup<br>
Invoked with filter=* gather_subset=['all']<br>
fact_path=/etc/ansible/facts.d gather_timeout=10<br>
Aug 30 05:55:54 lago-basic-suite-master-host-0 python: ansible-command<br>
Invoked with warn=True executable=None _uses_shell=False<br>
_raw_params=bash -c "rpm -qi vdsm | grep -oE<br>
'Version\\s+:\\s+[0-9\\.]+' | awk '{print $3}'" removes=None<br>
creates=None chdir=None<br>
Aug 30 05:55:54 lago-basic-suite-master-host-0 python: ansible-systemd<br>
Invoked with no_block=False name=libvirt-guests enabled=True<br>
daemon_reload=False state=started user=False masked=None<br>
Aug 30 05:55:54 lago-basic-suite-master-host-0 systemd: Reloading.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Cannot add<br>
dependency job for unit lvm2-lvmetad.socket, ignoring: Unit is masked.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Stopped MOM<br>
instance configured for VDSM purposes.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Stopping<br>
Virtual Desktop Server Manager...<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Starting<br>
Suspend Active Libvirt Guests...<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Started<br>
Suspend Active Libvirt Guests.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 journal: libvirt<br>
version: 2.0.0, package: 10.el7_3.9 (CentOS BuildSystem<br>
<<a href="http://bugs.centos.org" rel="noreferrer" target="_blank">http://bugs.centos.org</a>>, 2017-05-25-20:52:28, <a href="http://c1bm.rdu2.centos.org" rel="noreferrer" target="_blank">c1bm.rdu2.centos.org</a>)<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 journal: hostname:<br>
lago-basic-suite-master-host-0<wbr>.lago.local<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 vdsmd_init_common.sh:<br>
vdsm: Running run_final_hooks<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 journal: End of file<br>
while reading data: Input/output error<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Stopped<br>
Virtual Desktop Server Manager.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 python: ansible-command<br>
Invoked with warn=True executable=None _uses_shell=False<br>
_raw_params=bash -c "rpm -q vdsm --qf '%{VERSION}'" removes=None<br>
creates=None chdir=None<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 python: ansible-systemd<br>
Invoked with no_block=False name=iptables enabled=False<br>
daemon_reload=False state=stopped user=False masked=None<br>
<br>
[1]: <a href="http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/2197/artifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suite-master/post-002_bootstrap.py/lago-basic-suite-master-host-0/_var_log/messages/*view*/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/o<wbr>virt-master_change-queue-teste<wbr>r/2197/artifact/exported-artif<wbr>acts/basic-suit-master-el7/<wbr>test_logs/basic-suite-master/<wbr>post-002_bootstrap.py/lago-<wbr>basic-suite-master-host-0/_<wbr>var_log/messages/*view*/</a><br>
<br>
<br>
[2]: <a href="http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/2151/artifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suite-master/post-002_bootstrap.py/lago-basic-suite-master-host-0/_var_log/messages/*view*/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/o<wbr>virt-master_change-queue-teste<wbr>r/2151/artifact/exported-artif<wbr>acts/basic-suit-master-el7/<wbr>test_logs/basic-suite-master/<wbr>post-002_bootstrap.py/lago-<wbr>basic-suite-master-host-0/_<wbr>var_log/messages/*view*/</a><br>
<span class="m_175589461823944653HOEnZb"><font color="#888888"><br>
--<br>
Barak Korren<br>
RHV DevOps team , RHCE, RHCi<br>
Red Hat EMEA<br>
<a href="http://redhat.com" rel="noreferrer" target="_blank">redhat.com</a> | TRIED. TESTED. TRUSTED. | <a href="http://redhat.com/trusted" rel="noreferrer" target="_blank">redhat.com/trusted</a><br>
</font></span></blockquote></div></div></div><br></div>
<br>______________________________<wbr>_________________<br>
Devel mailing list<br>
<a href="mailto:Devel@ovirt.org">Devel@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/devel" rel="noreferrer" target="_blank">http://lists.ovirt.org/<wbr>mailman/listinfo/devel</a><br></blockquote></div><br></div>