<div dir="ltr">Hello everyone,<div><br></div><div>I'm about to start digging through the mailing list archives in search of a solution but thought I would post to the list as well. I'm running oVirt 3.6 on a 2 node CentOS7 cluster backed by fiber channel storage and with a separate engine VM running outside of the cluster (NOT hosted-engine).</div><div><br></div><div>When I try to move the SPM role from one node to the other I get the following in the web interface:</div><div><br></div><div><blockquote style="margin:0px 0px 0px 40px;border:none;padding:0px"><div><img src="cid:ii_159b34cabd239719" alt="Inline image 1" width="544" height="65"></div></blockquote></div><div><br></div><div>When I look into /var/log/ovirt-engine/engine.log I see the following:</div><div><br></div><blockquote style="margin:0px 0px 0px 40px;border:none;padding:0px"><div><div>2017-01-18 13:35:09,332 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksStatusesVDSCommand] (default task-26) [6990cfca] Failed in 'HSMGetAllTasksStatusesVDS' method</div></div><div><div>2017-01-18 13:35:09,340 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-26) [6990cfca] Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: VDSM v6 command failed: Logical Volume extend failed</div></div></blockquote><div><div><br></div><div>When I look at the task list on the host currently holding the SPM role (in this case 'v6'), using: vdsClient -s 0 getAllTasks, I see a long list like this:</div><div><br></div></div><blockquote style="margin:0px 0px 0px 40px;border:none;padding:0px"><div><div><div>dc75d3e7-cea7-449b-9a04-76fd8ef0f82b :</div></div></div><div><div><div> verb = downloadImageFromStream</div></div></div><div><div><div> code = 554</div></div></div><div><div><div> state = recovered</div></div></div><div><div><div> tag = spm</div></div></div><div><div><div> result =</div></div></div><div><div><div> message = Logical Volume extend failed</div></div></div><div><div><div> id = dc75d3e7-cea7-449b-9a04-76fd8ef0f82b</div></div></div></blockquote><div><div><br></div><div>When I look at /var/log/vdsm/vdsm.log on the host in question (v6) I see messages like this:</div><div><br></div></div><blockquote style="margin:0px 0px 0px 40px;border:none;padding:0px"><div><div>'531dd533-22b1-47a0-aae8-76c1dd7d9a56': {'code': 554, 'tag': u'spm', 'state': 'recovered', 'verb': 'downloadImageFromStreaam', 'result': '', 'message': 'Logical Volume extend failed', 'id': '531dd533-22b1-47a0-aae8-76c1dd7d9a56'}</div></div></blockquote><div><div><br></div><div>As well as the error from the attempted extend of the logical volume:</div><div><br></div></div><blockquote style="margin:0 0 0 40px;border:none;padding:0px"><div><div><div>e980df5f-d068-4c84-8aa7-9ce792690562::ERROR::2017-01-18 13:24:50,710::task::866::Storage.TaskManager.Task::(_setError) Task=`e980df5f-d068-4c84-8aa7-9ce792690562`::Unexpected error</div></div></div><div><div><div>Traceback (most recent call last):</div></div></div><div><div><div> File "/usr/share/vdsm/storage/task.py", line 873, in _run</div></div></div><div><div><div> return fn(*args, **kargs)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/task.py", line 332, in run</div></div></div><div><div><div> return self.cmd(*self.argslist, **self.argsdict)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/securable.py", line 77, in wrapper</div></div></div><div><div><div> return method(self, *args, **kwargs)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/sp.py", line 1776, in downloadImageFromStream</div></div></div><div><div><div> .copyToImage(methodArgs, sdUUID, imgUUID, volUUID)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/image.py", line 1373, in copyToImage</div></div></div><div><div><div> / volume.BLOCK_SIZE)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/blockVolume.py", line 310, in extend</div></div></div><div><div><div> lvm.extendLV(self.sdUUID, self.volUUID, sizemb)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/lvm.py", line 1179, in extendLV</div></div></div><div><div><div> _resizeLV("lvextend", vgName, lvName, size)</div></div></div><div><div><div> File "/usr/share/vdsm/storage/lvm.py", line 1175, in _resizeLV</div></div></div><div><div><div> raise se.LogicalVolumeExtendError(vgName, lvName, "%sM" % (size, ))</div></div></div><div><div><div>LogicalVolumeExtendError:</div></div></div><div><div><div>Logical Volume extend failed: 'vgname=ae05947f-875c-4507-ad51-62b0d35ef567 lvname=caaef597-eddd-4c24-8df2-a61f35f744f8 newsize=1M'</div></div></div><div><div><div>e980df5f-d068-4c84-8aa7-9ce792690562::DEBUG::2017-01-18 13:24:50,711::task::885::Storage.TaskManager.Task::(_run) Task=`e980df5f-d068-4c84-8aa7-9ce792690562`::Task._run: e980df5f-d068-4c84-8aa7-9ce792690562 () {} failed - stopping task</div></div></div></blockquote><div><div><br></div><div>The logical volume in question is an OVF_STORE disk that lives on one of the fiber channel backed LUNs. If I run:</div><div><br></div></div><blockquote style="margin:0 0 0 40px;border:none;padding:0px"><div><div>vdsClient -s 0 ClearTask TASK-UUID-HERE</div><div><br></div></div></blockquote>for each task that appears in the:<blockquote style="margin:0 0 0 40px;border:none;padding:0px"><div><br></div><div>vdsClient -s 0 getAllTasks </div><div><br></div></blockquote>output then they disappear and I'm able to move the SPM role to the other host.<br><div><div><div><br></div><div>This problem then crops up again on the new host once the SPM role is moved. What's going on here? Does anyone have any insight as to how to prevent this task from re-appearing? Or why it's failing in the first place?</div><div><br></div><div>Beau</div><div><br></div><br><div class="gmail_signature"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small;background-color:rgb(255,255,255)"><font color="#999999"><b></b></font></span></div><div><br></div></div></div>
</div></div></div>