<div dir="ltr"><br><br><div class="gmail_quote"><div dir="ltr">On Thu, Jan 25, 2018 at 1:29 PM Nir Soffer <<a href="mailto:nsoffer@redhat.com">nsoffer@redhat.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><p dir="ltr">The error shows that the host was not connected to the pool. Do we check that host is connected before trying to upload to the host?</p></blockquote><div>Yes, we're filtering non active hosts.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br><div class="gmail_quote"><div dir="ltr">בתאריך יום ה׳, 25 בינו׳ 2018, 10:33, מאת Daniel Erez <<a href="mailto:derez@redhat.com" target="_blank">derez@redhat.com</a>>:<br></div></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><div dir="ltr">On Wed, Jan 24, 2018 at 11:28 PM Dmitry Semenov <<a href="mailto:zend0@ya.ru" target="_blank">zend0@ya.ru</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">24.01.2018, 10:58, "Yaniv Kaul" <<a href="mailto:ykaul@redhat.com" target="_blank">ykaul@redhat.com</a>>:<br>
> On Tue, Jan 23, 2018 at 10:39 PM, Dmitry Semenov <<a href="mailto:zend0@ya.ru" target="_blank">zend0@ya.ru</a>> wrote:<br>
>> While loading disk image (via the web interface) in cluster01 on storage01, storage02 - everything is going well.<br>
>> While loading disk image (via the web interface) in cluster02 on storage03, storage04 - the problem occurs, the image isn't loaded, the process stops at the stage: paused by System (at the same time loading straightly through API goes without problems).<br>
>><br>
>> screenshot: <a href="https://yadi.sk/i/9WtkDlT23Riqxp" rel="noreferrer" target="_blank">https://yadi.sk/i/9WtkDlT23Riqxp</a><br>
>><br>
>> Logs are applied (engine.log): <a href="https://pastebin.com/54k5j7hC" rel="noreferrer" target="_blank">https://pastebin.com/54k5j7hC</a><br>
><br>
> Can you also share vdsm logs, at least from 01c04x09.unix.local ? It seems to have failed there.<br>
> Y.<br>
<br>
Here is a link to the vdsm.log file with 01c04x09.unix.local : <a href="https://pastebin.com/KiqYSYnP" rel="noreferrer" target="_blank">https://pastebin.com/KiqYSYnP</a></blockquote><div><br></div></div></div><div dir="ltr"><div class="gmail_quote"><div>According to the log [1], there was an error on <span style="white-space:pre-wrap">prepareImage invocation.</span></div><div><span style="white-space:pre-wrap">Seems like the storage pool wasn't found in vdsm cache, though according to the engine log[2]</span></div><div><span style="white-space:pre-wrap">it sends to correct pool id ('</span><span style="white-space:pre-wrap">dedc6e8b-30e3-42d1-86f4-a130110f31b1').</span></div><div><br></div><div><span style="white-space:pre-wrap">@Nir - what do you think? how could the pool be missing from the host? cache/connection issue?</span></div><div><br></div><div>[1]</div><div><pre style="word-wrap:break-word;white-space:pre-wrap">2018-01-23 23:16:44,005+0300 INFO (jsonrpc/5) [vdsm.api] START prepareImage(sdUUID=u'629ab576-638a-4c6f-b9c4-8d5cda64f9b2', spUUID=u'dedc6e8b-30e3-42d1-86f4-a130110f31b1', imgUUID=u'636c67ca-baec-4b32-be36-c3fb0ba24e83', leafUUID=u'f03fc310-7a09-4e23-a196-7dde70a1f616', allowIllegal=True) from=::ffff:10.65.35.10,51672, flow_id=0c170481-1518-426a-90c9-9ad7be27b926, task_id=345c9223-4425-49c3-981a-5f6a7d2dbee0 (api:46)
2018-01-23 23:16:44,006+0300 INFO (jsonrpc/5) [vdsm.api] FINISH prepareImage error=Unknown pool id, pool not connected: (u'dedc6e8b-30e3-42d1-86f4-a130110f31b1',) from=::ffff:10.65.35.10,51672, flow_id=0c170481-1518-426a-90c9-9ad7be27b926, task_id=345c9223-4425-49c3-981a-5f6a7d2dbee0 (api:50)
2018-01-23 23:16:44,006+0300 ERROR (jsonrpc/5) [storage.TaskManager.Task] (Task='345c9223-4425-49c3-981a-5f6a7d2dbee0') Unexpected error (task:875)
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/vdsm/storage/task.py", line 882, in _run
return fn(*args, **kargs)
File "<string>", line 2, in prepareImage
File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 48, in method
ret = func(*args, **kwargs)
File "/usr/lib/python2.7/site-packages/vdsm/storage/hsm.py", line 3151, in prepareImage
self.getPool(spUUID)
File "/usr/lib/python2.7/site-packages/vdsm/storage/hsm.py", line 349, in getPool
raise se.StoragePoolUnknown(spUUID)
StoragePoolUnknown: Unknown pool id, pool not connected: (u'dedc6e8b-30e3-42d1-86f4-a130110f31b1',)
2018-01-23 23:16:44,006+0300 INFO (jsonrpc/5) [storage.TaskManager.Task] (Task='345c9223-4425-49c3-981a-5f6a7d2dbee0') aborting: Task is aborted: "Unknown pool id, pool not connected: (u'dedc6e8b-30e3-42d1-86f4-a130110f31b1',)" - code 309 (task:1181)
2018-01-23 23:16:44,007+0300 ERROR (jsonrpc/5) [storage.Dispatcher] FINISH prepareImage error=Unknown pool id, pool not connected: (u'dedc6e8b-30e3-42d1-86f4-a130110f31b1',) (dispatcher:82)
2018-01-23 23:16:44,007+0300 INFO (jsonrpc/5) [jsonrpc.JsonRpcServer] RPC call Image.prepare failed (error 309) in 0.00 seconds (__init__:573)</pre></div><div>[2]</div><div><span style="white-space:pre-wrap">2018-01-23 23:16:38,408+03 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.CreateImageVDSCommand] (default task-38) [0c170481-1518-426a-90c9-9ad7be27b926] START, CreateImageVDSCommand( CreateImageVDSCommandParameters:{storagePoolId='dedc6e8b-30e3-42d1-86f4-a130110f31b1', ignoreFailoverLimit='false', storageDomainId='629ab576-638a-4c6f-b9c4-8d5cda64f9b2', imageGroupId='636c67ca-baec-4b32-be36-c3fb0ba24e83', imageSizeInBytes='23622320128', volumeFormat='COW', newImageId='f03fc310-7a09-4e23-a196-7dde70a1f616', imageType='Sparse', newImageDescription='{"DiskAlias":"aaaaaaaaaaaaaaaaaaaa","DiskDescription":""}', imageInitialSizeInBytes='1068367872'}), log id: 7666c2e4</span> </div></div></div><div dir="ltr"><div class="gmail_quote"><div><br></div><div><br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
<br>
And that's what I noticed, the server 01c04x09.unix.local is in cluster01 but I upload the image-file in to the storage02 (cluster02)<br>
<br>
><br>
>> image size: ~1.3 GB<br>
>><br>
>> my scheme:<br>
>><br>
>> data_center_01<br>
>> cluster01<br>
>> host01 \<br>
>> host02 - storage01, storage02<br>
>> host03 /<br>
>><br>
>> cluster02<br>
>> host04 \<br>
>> host05 - storage03, storage04<br>
>> host06 /<br>
>><br>
>> HostedEngine in cluster01<br>
>> oVirt: Version 4.2.0.2-1.el7.centos<br>
>><br>
>> --<br>
>> Best regards,<br>
>> _______________________________________________<br>
>> Devel mailing list<br>
>> <a href="mailto:Devel@ovirt.org" target="_blank">Devel@ovirt.org</a><br>
>> <a href="http://lists.ovirt.org/mailman/listinfo/devel" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman/listinfo/devel</a><br>
<br>
-- <br>
Best regards,<br>
</blockquote></div></div></blockquote></div></blockquote></div></div>