<div dir="ltr">Barak - can you prioritize fixing this one?<div>It blocks us from adding additional functionality into the ovirt-ansible-roles repo.</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Aug 31, 2017 at 11:54 AM, Martin Perina <span dir="ltr">&lt;<a href="mailto:mperina@redhat.com" target="_blank">mperina@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">So with ovirt-ansible-roles-1.1.0 (which is the last offically relased version) everything runs fine and host is added properly (tested several times on CentOS 7.3)<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">But we need to fix executing builds from github otherwise we cannot continue working with ovirt-ansible-roles in github:<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">1. If you add comment &#39;ci build please&#39; to github PR, then build will be executed and the result will be a repo with RPM to be used in OST (but this build will not be passed to queue to be added to tested repo)<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">2. When PR is merged, then either automatically or by some other comment/action (available to maintainers only) build will be executed and if build is OK, it can be queued to be added to tested repo<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Without above we just cannot continue working on ovirt-ansible-roles on github.<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Thanks<span class="HOEnZb"><font color="#888888"><br><br></font></span></div><span class="HOEnZb"><font color="#888888"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif">Martin<br><br></div><div class="gmail_default" style="font-family:arial,helvetica,sans-serif"><br></div></font></span></div><div class="gmail_extra"><br><div class="gmail_quote"><span class="">On Thu, Aug 31, 2017 at 8:55 AM, Barak Korren <span dir="ltr">&lt;<a href="mailto:bkorren@redhat.com" target="_blank">bkorren@redhat.com</a>&gt;</span> wrote:<br></span><div><div class="h5"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 30 August 2017 at 22:20, Martin Perina &lt;<a href="mailto:mperina@redhat.com" target="_blank">mperina@redhat.com</a>&gt; wrote:<br>
&gt;<br>
&gt;&gt;<br>
&gt;&gt; So we&#39;re back in square one.<br>
&gt;&gt; Another possible culprit may be ansible: Vdsm is stopped two seconds<br>
&gt;&gt; after it logs to the host.<br>
&gt;&gt;<br>
&gt;&gt; Aug 30 11:26:24 lago-basic-suite-master-host-0 systemd: Starting<br>
&gt;&gt; Session 10 of user root.<br>
&gt;&gt; Aug 30 11:26:25 lago-basic-suite-master-host-0 python: ansible-setup<br>
&gt;&gt; Invoked with filter=* gather_subset=[&#39;all&#39;]<br>
&gt;&gt; fact_path=/etc/ansible/facts.d gather_timeout=10<br>
&gt;&gt; Aug 30 11:26:25 lago-basic-suite-master-host-0 python: ansible-command<br>
&gt;&gt; Invoked with warn=True executable=None _uses_shell=False<br>
&gt;&gt; _raw_params=bash -c &quot;rpm -qi vdsm | grep -oE<br>
&gt;&gt; &#39;Version\\s+:\\s+[0-9\\.]+&#39; | awk &#39;{print $3}&#39;&quot; removes=None<br>
&gt;&gt; creates=None chdir=None<br>
&gt;&gt; Aug 30 11:26:26 lago-basic-suite-master-host-0 python: ansible-systemd<br>
&gt;&gt; Invoked with no_block=False name=libvirt-guests enabled=True<br>
&gt;&gt; daemon_reload=False state=started user=False masked=None<br>
&gt;&gt; Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Reloading.<br>
&gt;&gt; Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Cannot add<br>
&gt;&gt; dependency job for unit lvm2-lvmetad.socket, ignoring: Unit is masked.<br>
&gt;&gt; Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Stopped MOM<br>
&gt;&gt; instance configured for VDSM purposes.<br>
&gt;&gt; Aug 30 11:26:26 lago-basic-suite-master-host-0 systemd: Stopping<br>
&gt;&gt; Virtual Desktop Server Manager...<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; could it be that it triggers a systemd-reload that makes systemd croak<br>
&gt;&gt; on the vdsm-mom cycle?<br>
&gt;<br>
&gt;<br>
&gt; We are not restarting VDSM within ovirt-host-deploy Ansible role, the VDSM<br>
&gt; restart is performed in host-deploy part same as in previous versions.<br>
&gt;<br>
&gt; Within ovirt-host-deploy-firewalld we only enable and restart firewalld<br>
&gt; service.<br>
&gt;<br>
<br>
comparing a successful add-host flow [1] to a failed one [2] we notice<br>
that in the failed add host ansible logs in twice (session 10 and<br>
session 11). Could it be somehow related? Notice that Session 11 uses<br>
the OLD way (awk+grep based) to find vdsm&#39;s version.<br>
<br>
Aug 30 05:55:53 lago-basic-suite-master-host-0 systemd-logind: New<br>
session 10 of user root.<br>
Aug 30 05:55:53 lago-basic-suite-master-host-0 systemd: Starting<br>
Session 10 of user root.<br>
Aug 30 05:55:53 lago-basic-suite-master-host-0 python: ansible-setup<br>
Invoked with filter=* gather_subset=[&#39;all&#39;]<br>
fact_path=/etc/ansible/facts.d gather_timeout=10<br>
Aug 30 05:55:54 lago-basic-suite-master-host-0 python: ansible-command<br>
Invoked with warn=True executable=None _uses_shell=False<br>
_raw_params=bash -c &quot;rpm -qi vdsm | grep -oE<br>
&#39;Version\\s+:\\s+[0-9\\.]+&#39; | awk &#39;{print $3}&#39;&quot; removes=None<br>
creates=None chdir=None<br>
Aug 30 05:55:54 lago-basic-suite-master-host-0 python: ansible-systemd<br>
Invoked with no_block=False name=libvirt-guests enabled=True<br>
daemon_reload=False state=started user=False masked=None<br>
Aug 30 05:55:54 lago-basic-suite-master-host-0 systemd: Reloading.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Cannot add<br>
dependency job for unit lvm2-lvmetad.socket, ignoring: Unit is masked.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Stopped MOM<br>
instance configured for VDSM purposes.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Stopping<br>
Virtual Desktop Server Manager...<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Starting<br>
Suspend Active Libvirt Guests...<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Started<br>
Suspend Active Libvirt Guests.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 journal: libvirt<br>
version: 2.0.0, package: 10.el7_3.9 (CentOS BuildSystem<br>
&lt;<a href="http://bugs.centos.org" rel="noreferrer" target="_blank">http://bugs.centos.org</a>&gt;, 2017-05-25-20:52:28, <a href="http://c1bm.rdu2.centos.org" rel="noreferrer" target="_blank">c1bm.rdu2.centos.org</a>)<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 journal: hostname:<br>
lago-basic-suite-master-host-0<wbr>.lago.local<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 vdsmd_init_common.sh:<br>
vdsm: Running run_final_hooks<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 journal: End of file<br>
while reading data: Input/output error<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 systemd: Stopped<br>
Virtual Desktop Server Manager.<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 python: ansible-command<br>
Invoked with warn=True executable=None _uses_shell=False<br>
_raw_params=bash -c &quot;rpm -q vdsm --qf &#39;%{VERSION}&#39;&quot; removes=None<br>
creates=None chdir=None<br>
Aug 30 05:55:55 lago-basic-suite-master-host-0 python: ansible-systemd<br>
Invoked with no_block=False name=iptables enabled=False<br>
daemon_reload=False state=stopped user=False masked=None<br>
<br>
[1]: <a href="http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/2197/artifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suite-master/post-002_bootstrap.py/lago-basic-suite-master-host-0/_var_log/messages/*view*/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/o<wbr>virt-master_change-queue-teste<wbr>r/2197/artifact/exported-artif<wbr>acts/basic-suit-master-el7/<wbr>test_logs/basic-suite-master/<wbr>post-002_bootstrap.py/lago-<wbr>basic-suite-master-host-0/_<wbr>var_log/messages/*view*/</a><br>
<br>
<br>
[2]: <a href="http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/2151/artifact/exported-artifacts/basic-suit-master-el7/test_logs/basic-suite-master/post-002_bootstrap.py/lago-basic-suite-master-host-0/_var_log/messages/*view*/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/o<wbr>virt-master_change-queue-teste<wbr>r/2151/artifact/exported-artif<wbr>acts/basic-suit-master-el7/<wbr>test_logs/basic-suite-master/<wbr>post-002_bootstrap.py/lago-<wbr>basic-suite-master-host-0/_<wbr>var_log/messages/*view*/</a><br>
<span class="m_175589461823944653HOEnZb"><font color="#888888"><br>
--<br>
Barak Korren<br>
RHV DevOps team , RHCE, RHCi<br>
Red Hat EMEA<br>
<a href="http://redhat.com" rel="noreferrer" target="_blank">redhat.com</a> | TRIED. TESTED. TRUSTED. | <a href="http://redhat.com/trusted" rel="noreferrer" target="_blank">redhat.com/trusted</a><br>
</font></span></blockquote></div></div></div><br></div>
<br>______________________________<wbr>_________________<br>
Devel mailing list<br>
<a href="mailto:Devel@ovirt.org">Devel@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/devel" rel="noreferrer" target="_blank">http://lists.ovirt.org/<wbr>mailman/listinfo/devel</a><br></blockquote></div><br></div>