<div dir="ltr">Hi Nir,<br><div><div class="gmail_extra"><br><div class="gmail_quote">On 16 February 2017 at 13:55, Nir Soffer <span dir="ltr">&lt;<a href="mailto:nsoffer@redhat.com" target="_blank">nsoffer@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On Mon, Feb 13, 2017 at 3:35 PM, Doug Ingham &lt;<a href="mailto:dougti@gmail.com">dougti@gmail.com</a>&gt; wrote:<br>
&gt; Hi Sahina,<br>
&gt;<br>
&gt; On 13 February 2017 at 05:45, Sahina Bose &lt;<a href="mailto:sabose@redhat.com">sabose@redhat.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; Any errors in the gluster mount logs for this gluster volume?<br>
&gt;&gt;<br>
&gt;&gt; How about &quot;gluster vol heal &lt;volname&gt; info&quot; - does it list any entries to<br>
&gt;&gt; heal?<br>
&gt;<br>
&gt;<br>
&gt; After more investigating, I found out that there is a sanlock daemon that<br>
&gt; runs with VDSM, independently of the HE, so I&#39;d basically have to bring the<br>
&gt; volume down &amp; wait for the leases to expire/delete them* before I can import<br>
&gt; the domain.<br>
&gt;<br>
&gt; *I understand removing /dom_md/leases/ should do the job?<br>
<br>
</span>No, the issue is probably dom_md/ids accessed by sanlock, but removing files<br>
accessed by sanlock will not help, an open file will remain open until sanlock<br>
close the file.<br></blockquote><div><br></div><div>I&#39;m testing this with volume snapshots at the moment, so there are no processes accessing the new volume.<br></div><div> <br><br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Did you try to reboot the host before installing it again? If you did and you<br>
still have these issues, you probably need to remove the previous installation<br>
properly before installing again.<br>
<br>
Adding Simone to help with uninstalling and reinstalling hosted engine.<br></blockquote><div><br></div><div>The Hosted-Engine database had been corrupted and the restore wasn&#39;t running correctly, so I installed a new engine on a new server - no restores or old data. The aim is to import the old storage domain into the new Engine &amp; then import the VMs into the new storage domain.<br></div><div>My only problem with this is that there appear to be some file based leases somewhere that, unless I manage to locate &amp; delete them, force me to wait for the leases to timeout before I can import the old storage domain.<br></div><div>To minimise downtime, I&#39;m trying to avoid having to wait for the leases to timeout.<br><br></div><div>Regards,<br></div><div> Doug<br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<span class="HOEnZb"><font color="#888888"><br>
Nir<br>
</font></span><div class="HOEnZb"><div class="h5"><br>
&gt;<br>
&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; On Thu, Feb 9, 2017 at 11:57 PM, Doug Ingham &lt;<a href="mailto:dougti@gmail.com">dougti@gmail.com</a>&gt; wrote:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Some interesting output from the vdsm log...<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; 2017-02-09 15:16:24,051 INFO  (jsonrpc/1) [storage.StorageDomain]<br>
&gt;&gt;&gt; Resource namespace 01_img_60455567-ad30-42e3-<wbr>a9df-62fe86c7fd25 already<br>
&gt;&gt;&gt; registered (sd:731)<br>
&gt;&gt;&gt; 2017-02-09 15:16:24,051 INFO  (jsonrpc/1) [storage.StorageDomain]<br>
&gt;&gt;&gt; Resource namespace 02_vol_60455567-ad30-42e3-<wbr>a9df-62fe86c7fd25 already<br>
&gt;&gt;&gt; registered (sd:740)<br>
&gt;&gt;&gt; 2017-02-09 15:16:24,052 INFO  (jsonrpc/1) [storage.SANLock] Acquiring<br>
&gt;&gt;&gt; Lease(name=&#39;SDM&#39;,<br>
&gt;&gt;&gt; path=u&#39;/rhev/data-center/mnt/<wbr>glusterSD/localhost:data2/<wbr>60455567-ad30-42e3-a9df-6<br>
&gt;&gt;&gt; 2fe86c7fd25/dom_md/leases&#39;, offset=1048576) for host id 1<br>
&gt;&gt;&gt; (clusterlock:343)<br>
&gt;&gt;&gt; 2017-02-09 15:16:24,057 INFO  (jsonrpc/1) [storage.SANLock] Releasing<br>
&gt;&gt;&gt; host id for domain 60455567-ad30-42e3-a9df-<wbr>62fe86c7fd25 (id: 1)<br>
&gt;&gt;&gt; (clusterlock:305)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,149 INFO  (jsonrpc/3) [jsonrpc.JsonRpcServer] RPC<br>
&gt;&gt;&gt; call GlusterHost.list succeeded in 0.17 seconds (__init__:515)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,264 INFO  (Reactor thread)<br>
&gt;&gt;&gt; [ProtocolDetector.<wbr>AcceptorImpl] Accepted connection from<br>
&gt;&gt;&gt; ::ffff:<a href="http://127.0.0.1:55060" rel="noreferrer" target="_blank">127.0.0.1:55060</a> (protocoldetector:72)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,270 INFO  (Reactor thread)<br>
&gt;&gt;&gt; [ProtocolDetector.Detector] Detected protocol stomp from<br>
&gt;&gt;&gt; ::ffff:<a href="http://127.0.0.1:55060" rel="noreferrer" target="_blank">127.0.0.1:55060</a> (protocoldetector:127)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,271 INFO  (Reactor thread) [Broker.StompAdapter]<br>
&gt;&gt;&gt; Processing CONNECT request (stompreactor:102)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,271 INFO  (JsonRpc (StompReactor))<br>
&gt;&gt;&gt; [Broker.StompAdapter] Subscribe command received (stompreactor:129)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,416 INFO  (jsonrpc/5) [jsonrpc.JsonRpcServer] RPC<br>
&gt;&gt;&gt; call Host.getHardwareInfo succeeded in 0.01 seconds (__init__:515)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,419 INFO  (jsonrpc/6) [dispatcher] Run and protect:<br>
&gt;&gt;&gt; repoStats(options=None) (logUtils:49)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,419 INFO  (jsonrpc/6) [dispatcher] Run and protect:<br>
&gt;&gt;&gt; repoStats, Return response: {u&#39;e8d04da7-ad3d-4227-a45d-<wbr>b5a29b2f43e5&#39;:<br>
&gt;&gt;&gt; {&#39;code&#39;: 0, &#39;actual&#39;: True<br>
&gt;&gt;&gt; , &#39;version&#39;: 4, &#39;acquired&#39;: True, &#39;delay&#39;: &#39;0.000854128&#39;, &#39;lastCheck&#39;:<br>
&gt;&gt;&gt; &#39;5.1&#39;, &#39;valid&#39;: True}, u&#39;a77b8821-ff19-4d17-a3ce-<wbr>a6c3a69436d5&#39;: {&#39;code&#39;: 0,<br>
&gt;&gt;&gt; &#39;actual&#39;: True, &#39;vers<br>
&gt;&gt;&gt; ion&#39;: 4, &#39;acquired&#39;: True, &#39;delay&#39;: &#39;0.000966556&#39;, &#39;lastCheck&#39;: &#39;2.6&#39;,<br>
&gt;&gt;&gt; &#39;valid&#39;: True}} (logUtils:52)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,447 INFO  (jsonrpc/6) [jsonrpc.JsonRpcServer] RPC<br>
&gt;&gt;&gt; call Host.getStats succeeded in 0.03 seconds (__init__:515)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,450 ERROR (JsonRpc (StompReactor)) [vds.dispatcher]<br>
&gt;&gt;&gt; SSL error receiving from &lt;yajsonrpc.betterAsyncore.<wbr>Dispatcher connected<br>
&gt;&gt;&gt; (&#39;::ffff:127.0.0.1&#39;, 55060, 0, 0) at 0x7f69c0043cf8&gt;: unexpected eof<br>
&gt;&gt;&gt; (betterAsyncore:113)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,812 INFO  (jsonrpc/7) [jsonrpc.JsonRpcServer] RPC<br>
&gt;&gt;&gt; call GlusterVolume.list succeeded in 0.10 seconds (__init__:515)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,940 INFO  (Reactor thread)<br>
&gt;&gt;&gt; [ProtocolDetector.<wbr>AcceptorImpl] Accepted connection from<br>
&gt;&gt;&gt; ::ffff:<a href="http://127.0.0.1:55062" rel="noreferrer" target="_blank">127.0.0.1:55062</a> (protocoldetector:72)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,946 INFO  (Reactor thread)<br>
&gt;&gt;&gt; [ProtocolDetector.Detector] Detected protocol stomp from<br>
&gt;&gt;&gt; ::ffff:<a href="http://127.0.0.1:55062" rel="noreferrer" target="_blank">127.0.0.1:55062</a> (protocoldetector:127)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,947 INFO  (Reactor thread) [Broker.StompAdapter]<br>
&gt;&gt;&gt; Processing CONNECT request (stompreactor:102)<br>
&gt;&gt;&gt; 2017-02-09 15:16:25,947 INFO  (JsonRpc (StompReactor))<br>
&gt;&gt;&gt; [Broker.StompAdapter] Subscribe command received (stompreactor:129)<br>
&gt;&gt;&gt; 2017-02-09 15:16:26,058 ERROR (jsonrpc/1) [storage.TaskManager.Task]<br>
&gt;&gt;&gt; (Task=&#39;02cad901-5fe8-4f2d-<wbr>895b-14184f67feab&#39;) Unexpected error (task:870)<br>
&gt;&gt;&gt; Traceback (most recent call last):<br>
&gt;&gt;&gt;   File &quot;/usr/share/vdsm/storage/task.<wbr>py&quot;, line 877, in _run<br>
&gt;&gt;&gt;     return fn(*args, **kargs)<br>
&gt;&gt;&gt;   File &quot;/usr/lib/python2.7/site-<wbr>packages/vdsm/logUtils.py&quot;, line 50, in<br>
&gt;&gt;&gt; wrapper<br>
&gt;&gt;&gt;     res = f(*args, **kwargs)<br>
&gt;&gt;&gt;   File &quot;/usr/share/vdsm/storage/hsm.<wbr>py&quot;, line 812, in<br>
&gt;&gt;&gt; forcedDetachStorageDomain<br>
&gt;&gt;&gt;     self._<wbr>deatchStorageDomainFromOldPool<wbr>s(sdUUID)<br>
&gt;&gt;&gt;   File &quot;/usr/share/vdsm/storage/hsm.<wbr>py&quot;, line 790, in<br>
&gt;&gt;&gt; _<wbr>deatchStorageDomainFromOldPool<wbr>s<br>
&gt;&gt;&gt;     dom.acquireClusterLock(host_<wbr>id)<br>
&gt;&gt;&gt;   File &quot;/usr/share/vdsm/storage/sd.<wbr>py&quot;, line 810, in acquireClusterLock<br>
&gt;&gt;&gt;     self._manifest.<wbr>acquireDomainLock(hostID)<br>
&gt;&gt;&gt;   File &quot;/usr/share/vdsm/storage/sd.<wbr>py&quot;, line 499, in acquireDomainLock<br>
&gt;&gt;&gt;     self._domainLock.acquire(<wbr>hostID, self.getDomainLease())<br>
&gt;&gt;&gt;   File &quot;/usr/lib/python2.7/site-<wbr>packages/vdsm/storage/<wbr>clusterlock.py&quot;,<br>
&gt;&gt;&gt; line 362, in acquire<br>
&gt;&gt;&gt;     &quot;Cannot acquire %s&quot; % (lease,), str(e))<br>
&gt;&gt;&gt; AcquireLockFailure: Cannot obtain lock:<br>
&gt;&gt;&gt; u&quot;id=60455567-ad30-42e3-a9df-<wbr>62fe86c7fd25, rc=5, out=Cannot acquire<br>
&gt;&gt;&gt; Lease(name=&#39;SDM&#39;,<br>
&gt;&gt;&gt; path=u&#39;/rhev/data-center/mnt/<wbr>glusterSD/localhost:data2/<wbr>60455567-ad30-42e3-a9df-<wbr>62fe86c7fd25/dom_md/leases&#39;,<br>
&gt;&gt;&gt; offset=1048576), err=(5, &#39;Sanlock resource not acquired&#39;, &#39;Input/output<br>
&gt;&gt;&gt; error&#39;)&quot;<br>
&gt;&gt;&gt; 2017-02-09 15:16:26,058 INFO  (jsonrpc/1) [storage.TaskManager.Task]<br>
&gt;&gt;&gt; (Task=&#39;02cad901-5fe8-4f2d-<wbr>895b-14184f67feab&#39;) aborting: Task is aborted:<br>
&gt;&gt;&gt; &#39;Cannot obtain lock&#39; - code 651 (task:1175)<br>
&gt;&gt;&gt; 2017-02-09 15:16:26,059 ERROR (jsonrpc/1) [storage.Dispatcher] {&#39;status&#39;:<br>
&gt;&gt;&gt; {&#39;message&#39;: &#39;Cannot obtain lock: u&quot;id=60455567-ad30-42e3-a9df-<wbr>62fe86c7fd25,<br>
&gt;&gt;&gt; rc=5, out=Cannot acquire Lease(name=\&#39;SDM\&#39;,<br>
&gt;&gt;&gt; path=u\&#39;/rhev/data-center/mnt/<wbr>glusterSD/localhost:data2/<wbr>60455567-ad30-42e3-a9df-<wbr>62fe86c7fd25/dom_md/leases\&#39;,<br>
&gt;&gt;&gt; offset=1048576), err=(5, \&#39;Sanlock resource not acquired\&#39;, \&#39;Input/output<br>
&gt;&gt;&gt; error\&#39;)&quot;&#39;, &#39;code&#39;: 651}} (dispatcher:77)<br>
&gt;&gt;&gt; 2017-02-09 15:16:26,059 INFO  (jsonrpc/1) [jsonrpc.JsonRpcServer] RPC<br>
&gt;&gt;&gt; call StorageDomain.detach failed (error 651) in 23.04 seconds (__init__:515)<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --<br>
&gt;&gt;&gt; Doug<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; ______________________________<wbr>_________________<br>
&gt;&gt;&gt; Users mailing list<br>
&gt;&gt;&gt; <a href="mailto:Users@ovirt.org">Users@ovirt.org</a><br>
&gt;&gt;&gt; <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/<wbr>mailman/listinfo/users</a><br>
&gt;&gt;&gt;<br>
&gt;&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt; Doug<br>
&gt;<br>
&gt; ______________________________<wbr>_________________<br>
&gt; Users mailing list<br>
&gt; <a href="mailto:Users@ovirt.org">Users@ovirt.org</a><br>
&gt; <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/<wbr>mailman/listinfo/users</a><br>
&gt;<br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature" data-smartmail="gmail_signature">Doug</div>
</div></div></div>