<div dir="ltr"><br><br><div class="gmail_quote"><div dir="ltr">On Sun, Mar 18, 2018 at 4:21 PM Yedidyah Bar David &lt;<a href="mailto:didi@redhat.com">didi@redhat.com</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On Sun, Mar 18, 2018 at 2:48 PM, Yedidyah Bar David &lt;<a href="mailto:didi@redhat.com" target="_blank">didi@redhat.com</a>&gt; wrote:<br>
&gt; On Sun, Mar 18, 2018 at 1:45 PM, Yedidyah Bar David &lt;<a href="mailto:didi@redhat.com" target="_blank">didi@redhat.com</a>&gt; wrote:<br>
&gt;&gt; On Sun, Mar 18, 2018 at 11:20 AM,  &lt;<a href="mailto:jenkins@jenkins.phx.ovirt.org" target="_blank">jenkins@jenkins.phx.ovirt.org</a>&gt; wrote:<br>
&gt;&gt;&gt; Project: <a href="http://jenkins.ovirt.org/job/ovirt-system-tests_he-basic-suite-4.1/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/ovirt-system-tests_he-basic-suite-4.1/</a><br>
&gt;&gt;&gt; Build: <a href="http://jenkins.ovirt.org/job/ovirt-system-tests_he-basic-suite-4.1/223/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/ovirt-system-tests_he-basic-suite-4.1/223/</a><br>
&gt;&gt;&gt; Build Number: 223<br>
&gt;&gt;&gt; Build Status:  Still Failing<br>
&gt;&gt;&gt; Triggered By: Started by timer<br>
&gt;&gt;<br>
&gt;&gt; It was broken by:<br>
&gt;&gt;<br>
&gt;&gt; [1] <a href="https://gerrit.ovirt.org/88483" rel="noreferrer" target="_blank">https://gerrit.ovirt.org/88483</a><br>
&gt;&gt;<br>
&gt;&gt; It should be fixed by:<br>
&gt;&gt;<br>
&gt;&gt; [2] <a href="https://bugzilla.redhat.com/show_bug.cgi?id=1554283" rel="noreferrer" target="_blank">https://bugzilla.redhat.com/show_bug.cgi?id=1554283</a><br>
&gt;&gt;<br>
&gt;&gt; [2] is on modified, no idea about its status other than that.<br>
&gt;&gt;<br>
&gt;&gt; I didn&#39;t intend to merge [1] before [2] is fixed, not sure if<br>
&gt;&gt; Sandro didn&#39;t notice my comment there, or thought that [2] is fixed.<br>
&gt;&gt;<br>
&gt;&gt; If it&#39;s annoying we can revert [1] and re-merge when [2] is fixed.<br>
&gt;<br>
&gt; Talked with Gal, and pushed this, which should hopefully fix:<br>
&gt;<br>
&gt; <a href="https://gerrit.ovirt.org/89136" rel="noreferrer" target="_blank">https://gerrit.ovirt.org/89136</a><br>
<br>
It indeed seems to fix [1][2]:<br>
<br>
13:13:24   # he_get_shared_config:<br>
13:13:26   # he_get_shared_config: Success (in 0:00:01)<br>
13:13:26   # sleep:<br>
13:15:26   # sleep: Success (in 0:02:00)<br>
13:15:26   # add_he_hosts:<br>
13:16:18   # add_he_hosts: Success (in 0:00:52)<br>
13:16:18   # he_check_ha_agent:<br>
13:16:19   # he_check_ha_agent: Success (in 0:00:00)<br>
<br>
But later fails:<br>
<br>
13:16:20   # add_secondary_storage_domains:<br>
13:19:30 Error while running thread<br>
13:19:30 Traceback (most recent call last):<br>
13:19:30   File &quot;/usr/lib/python2.7/site-packages/lago/utils.py&quot;, line<br>
58, in _ret_via_queue<br>
13:19:30     queue.put({&#39;return&#39;: func()})<br>
13:19:30   File<br>
&quot;/home/jenkins/workspace/ovirt-system-tests_master_check-patch-el7-x86_64/ovirt-system-tests/he-basic-suite-4.1/test-scenarios/002_bootstrap.py&quot;,<br>
line 491, in add_nfs_storage_domain<br>
13:19:30     add_generic_nfs_storage_domain(prefix, SD_NFS_NAME,<br>
SD_NFS_HOST_NAME, SD_NFS_PATH)<br>
13:19:30   File<br>
&quot;/home/jenkins/workspace/ovirt-system-tests_master_check-patch-el7-x86_64/ovirt-system-tests/he-basic-suite-4.1/test-scenarios/002_bootstrap.py&quot;,<br>
line 496, in add_generic_nfs_storage_domain<br>
13:19:30     add_generic_nfs_storage_domain_4(prefix, sd_nfs_name,<br>
nfs_host_name, mount_path, sd_format, sd_type, nfs_version)<br>
13:19:30   File<br>
&quot;/home/jenkins/workspace/ovirt-system-tests_master_check-patch-el7-x86_64/ovirt-system-tests/he-basic-suite-4.1/test-scenarios/002_bootstrap.py&quot;,<br>
line 552, in add_generic_nfs_storage_domain_4<br>
13:19:30     _add_storage_domain_4(api, p)<br>
13:19:30   File<br>
&quot;/home/jenkins/workspace/ovirt-system-tests_master_check-patch-el7-x86_64/ovirt-system-tests/he-basic-suite-4.1/test-scenarios/002_bootstrap.py&quot;,<br>
line 466, in _add_storage_domain_4<br>
13:19:30     id=<a href="http://sd.id" rel="noreferrer" target="_blank">sd.id</a>,<br>
13:19:30   File<br>
&quot;/usr/lib64/python2.7/site-packages/ovirtsdk4/services.py&quot;, line 2219,<br>
in add<br>
13:19:30     return self._internal_add(storage_domain, headers, query, wait)<br>
13:19:30   File<br>
&quot;/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py&quot;, line 223,<br>
in _internal_add<br>
13:19:30     return future.wait() if wait else future<br>
13:19:30   File<br>
&quot;/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py&quot;, line 53, in<br>
wait<br>
13:19:30     return self._code(response)<br>
13:19:30   File<br>
&quot;/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py&quot;, line 220,<br>
in callback<br>
13:19:30     self._check_fault(response)<br>
13:19:30   File<br>
&quot;/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py&quot;, line 123,<br>
in _check_fault<br>
13:19:30     self._raise_error(response, body)<br>
13:19:30   File<br>
&quot;/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py&quot;, line 109,<br>
in _raise_error<br>
13:19:30     raise error<br>
13:19:30 Error: Fault reason is &quot;Operation Failed&quot;. Fault detail is<br>
&quot;[Storage domain cannot be reached. Please ensure it is accessible<br>
from the host(s).]&quot;. HTTP response code is 400.<br>
13:19:30 Error while running thread<br>
<br>
I&#39;ll retrigger now, but perhaps someone from storage wants to check.<br>
vdsm log has[3]:<br>
<br>
2018-03-18 08:59:52,918-0400 INFO  (jsonrpc/3) [jsonrpc.JsonRpcServer]<br>
RPC call Host.getHardwareInfo succeeded in 0.00 seconds (__init__:539)<br>
2018-03-18 08:59:52,924-0400 INFO  (jsonrpc/4) [vdsm.api] START<br>
prepareImage(sdUUID=u&#39;424e809d-b7ad-4ed5-b6a1-5426d373a5d2&#39;,<br>
spUUID=u&#39;8509c64f-cdd7-4713-9e09-a79d90ba26ed&#39;,<br>
imgUUID=u&#39;718090ba-b36e-45cc-bcd6-597c16a766b9&#39;,<br>
leafUUID=u&#39;23ee843b-20e0-4afe-98dc-e165334ac710&#39;, allowIllegal=False)<br>
from=::1,36020, task_id=92aeabf2-084a-4c98-a6e5-03efae38e7b3 (api:46)<br>
2018-03-18 08:59:52,928-0400 INFO  (jsonrpc/4) [vdsm.api] FINISH<br>
prepareImage error=Volume does not exist:<br>
(u&#39;23ee843b-20e0-4afe-98dc-e165334ac710&#39;,) from=::1,36020,<br>
task_id=92aeabf2-084a-4c98-a6e5-03efae38e7b3 (api:50)<br>
2018-03-18 08:59:52,928-0400 ERROR (jsonrpc/4)<br>
[storage.TaskManager.Task]<br>
(Task=&#39;92aeabf2-084a-4c98-a6e5-03efae38e7b3&#39;) Unexpected error<br>
(task:872)<br>
Traceback (most recent call last):<br>
  File &quot;/usr/share/vdsm/storage/task.py&quot;, line 879, in _run<br>
    return fn(*args, **kargs)<br>
  File &quot;&lt;string&gt;&quot;, line 2, in prepareImage<br>
  File &quot;/usr/lib/python2.7/site-packages/vdsm/common/api.py&quot;, line 48, in method<br>
    ret = func(*args, **kwargs)<br>
  File &quot;/usr/share/vdsm/storage/hsm.py&quot;, line 3137, in prepareImage<br>
    raise se.VolumeDoesNotExist(leafUUID)<br>
VolumeDoesNotExist: Volume does not exist:<br>
(u&#39;23ee843b-20e0-4afe-98dc-e165334ac710&#39;,)<br></blockquote><div><br></div><div>Immediately after this failed prepare, we see:</div><div> </div>2018-03-18 08:59:52,941-0400 INFO (jsonrpc/7) [vdsm.api] START createVolume(sdUUID=u&#39;424e809d-b7ad-4ed5-b6a1-5426d373a5d2&#39;, spUUID=u&#39;8509c64f-cdd7-4713-9e09-a79d90ba26ed&#39;, imgUUID=u&#39;718090ba-b36e-45cc-bcd6-597c16a766b9&#39;, size=u&#39;1048576&#39;, volFormat=5, preallocate=1, diskType=2, volUUID=u&#39;23ee843b-20e0-4afe-98dc-e165334ac710&#39;, desc=u&#39;hosted-engine.lockspace&#39;, srcImgUUID=u&#39;00000000-0000-0000-0000-000000000000&#39;, srcVolUUID=u&#39;00000000-0000-0000-0000-000000000000&#39;, initialSize=None) from=::1,36020, task_id=c4c73314-ee10-419a-8265-b8b87763e99f (api:46) </div><div class="gmail_quote"><br></div><div class="gmail_quote">So that volume really did not exit when it was prepared, which means it is a client</div><div class="gmail_quote">error, not vdsm error.</div><div class="gmail_quote"><br></div><div class="gmail_quote">The caller (hosted engine agent?) should use Volume.getInfo to check if a volume</div><div class="gmail_quote">exits. Non existing volume is considered expected error in this call and will be</div><div class="gmail_quote">logged as with INFO log level.</div><div class="gmail_quote"><br></div><div class="gmail_quote">See <a href="https://github.com/oVirt/vdsm/blob/f286cf7825578d8997ca6abc0801a823dee247ef/lib/vdsm/storage/hsm.py#L3075">https://github.com/oVirt/vdsm/blob/f286cf7825578d8997ca6abc0801a823dee247ef/lib/vdsm/storage/hsm.py#L3075</a><br>    <br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
[1] <a href="http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/4537/" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/4537/</a><br>
[2] <a href="http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/4537/consoleFull" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/4537/consoleFull</a><br>
[3] <a href="http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/4537/artifact/exported-artifacts/he-basic-suite-4.1__logs/test_logs/he-basic-suite-4.1/post-002_bootstrap.py/lago-he-basic-suite-4-1-host0/_var_log/vdsm/vdsm.log" rel="noreferrer" target="_blank">http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/4537/artifact/exported-artifacts/he-basic-suite-4.1__logs/test_logs/he-basic-suite-4.1/post-002_bootstrap.py/lago-he-basic-suite-4-1-host0/_var_log/vdsm/vdsm.log</a><br>
<br>
&gt;<br>
&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; -------------------------------------<br>
&gt;&gt;&gt; Changes Since Last Success:<br>
&gt;&gt;&gt; -------------------------------------<br>
&gt;&gt;&gt; Changes for Build #218<br>
&gt;&gt;&gt; [Martin Sivak] Cleanup of test code - next() replaced with any()<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Changes for Build #219<br>
&gt;&gt;&gt; [Martin Sivak] Cleanup of test code - next() replaced with any()<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Changes for Build #220<br>
&gt;&gt;&gt; [Martin Sivak] Cleanup of test code - next() replaced with any()<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Changes for Build #221<br>
&gt;&gt;&gt; [Martin Sivak] Cleanup of test code - next() replaced with any()<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Changes for Build #222<br>
&gt;&gt;&gt; [Martin Sivak] Cleanup of test code - next() replaced with any()<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Changes for Build #223<br>
&gt;&gt;&gt; [Martin Sivak] Cleanup of test code - next() replaced with any()<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; -----------------<br>
&gt;&gt;&gt; Failed Tests:<br>
&gt;&gt;&gt; -----------------<br>
&gt;&gt;&gt; 1 tests failed.<br>
&gt;&gt;&gt; FAILED:  002_bootstrap.he_check_ha_agent<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Error Message:<br>
&gt;&gt;&gt; False is not true<br>
&gt;&gt;&gt; -------------------- &gt;&gt; begin captured logging &lt;&lt; --------------------<br>
&gt;&gt;&gt; lago.ssh: DEBUG: start task:cb75d7f3-e333-47f1-b3e9-76ff9c0b4b21:Get ssh client for lago-he-basic-suite-4-1-host0:<br>
&gt;&gt;&gt; lago.ssh: DEBUG: end task:cb75d7f3-e333-47f1-b3e9-76ff9c0b4b21:Get ssh client for lago-he-basic-suite-4-1-host0:<br>
&gt;&gt;&gt; lago.ssh: DEBUG: Running 7e67a956 on lago-he-basic-suite-4-1-host0: grep Permission denied /var/log/ovirt-hosted-engine-ha/agent.log<br>
&gt;&gt;&gt; lago.ssh: DEBUG: Command 7e67a956 on lago-he-basic-suite-4-1-host0 returned with 2<br>
&gt;&gt;&gt; lago.ssh: DEBUG: Command 7e67a956 on lago-he-basic-suite-4-1-host0 output:<br>
&gt;&gt;&gt;  /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt; /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; lago.ssh: DEBUG: Command 7e67a956 on lago-he-basic-suite-4-1-host0  errors:<br>
&gt;&gt;&gt;  grep: denied: No such file or directory<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --------------------- &gt;&gt; end captured logging &lt;&lt; ---------------------<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Stack Trace:<br>
&gt;&gt;&gt;   File &quot;/usr/lib64/python2.7/unittest/case.py&quot;, line 369, in run<br>
&gt;&gt;&gt;     testMethod()<br>
&gt;&gt;&gt;   File &quot;/usr/lib/python2.7/site-packages/nose/case.py&quot;, line 197, in runTest<br>
&gt;&gt;&gt;     self.test(*self.arg)<br>
&gt;&gt;&gt;   File &quot;/usr/lib/python2.7/site-packages/ovirtlago/testlib.py&quot;, line 129, in wrapped_test<br>
&gt;&gt;&gt;     test()<br>
&gt;&gt;&gt;   File &quot;/usr/lib/python2.7/site-packages/ovirtlago/testlib.py&quot;, line 59, in wrapper<br>
&gt;&gt;&gt;     return func(get_test_prefix(), *args, **kwargs)<br>
&gt;&gt;&gt;   File &quot;/home/jenkins/workspace/ovirt-system-tests_he-basic-suite-4.1/ovirt-system-tests/he-basic-suite-4.1/test-scenarios/002_bootstrap.py&quot;, line 1036, in he_check_ha_agent<br>
&gt;&gt;&gt;     &#39;Permission denied&#39; not in result.out<br>
&gt;&gt;&gt;   File &quot;/usr/lib64/python2.7/unittest/case.py&quot;, line 462, in assertTrue<br>
&gt;&gt;&gt;     raise self.failureException(msg)<br>
&gt;&gt;&gt; &quot;False is not true\n-------------------- &gt;&gt; begin captured logging &lt;&lt; --------------------\nlago.ssh: DEBUG: start task:cb75d7f3-e333-47f1-b3e9-76ff9c0b4b21:Get ssh client for lago-he-basic-suite-4-1-host0:\nlago.ssh: DEBUG: end task:cb75d7f3-e333-47f1-b3e9-76ff9c0b4b21:Get ssh client for lago-he-basic-suite-4-1-host0:\nlago.ssh: DEBUG: Running 7e67a956 on lago-he-basic-suite-4-1-host0: grep Permission denied /var/log/ovirt-hosted-engine-ha/agent.log\nlago.ssh: DEBUG: Command 7e67a956 on lago-he-basic-suite-4-1-host0 returned with 2\nlago.ssh: DEBUG: Command 7e67a956 on lago-he-basic-suite-4-1-host0 output:\n /var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf<br>
 &#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n/var/log/ovirt-hosted-engine-ha/agent.log:IOError: [Errno 13] Permission denied: &#39;/var/run/ovirt-hosted-engine-ha/vm.conf&#39;\n\nlago.ssh: DEBUG: Command 7e67a956 on lago-he-basic-suite-4-1-host0  errors:\n grep: denied: No such file or directory\n\n--------------------- &gt;&gt; end captured logging &lt;&lt; ---------------------&quot;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Didi<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt; Didi<br>
<br>
<br>
<br>
--<br>
Didi<br>
_______________________________________________<br>
Infra mailing list<br>
<a href="mailto:Infra@ovirt.org" target="_blank">Infra@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/infra" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman/listinfo/infra</a><br>
</blockquote></div></div>