<div dir="ltr">I am running vdsm from packages as my interest is in developing for the engine and not vdsm.<div>I updated the vdsm package in an attempt to solve this, now I have:</div><div><div># rpm -q vdsm</div><div>vdsm-4.10.3-10.fc18.x86_64</div>
<div><br></div><div>I noticed that when the storage domain crashes I can't even do "df -h" (hangs)</div><div>I'm also getting some errors in /var/log/messages:</div><div><br></div><div><div>Mar 24 19:57:44 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div>
<div>Mar 24 19:57:45 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div><div>Mar 24 19:57:46 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div>
<div>Mar 24 19:57:47 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div><div>Mar 24 19:57:48 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div>
<div>Mar 24 19:57:49 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div><div>Mar 24 19:57:50 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div>
<div>Mar 24 19:57:51 bufferoverflow sanlock[1208]: 2013-03-24 19:57:51+0200 7412 [4759]: 1083422e close_task_aio 0 0x7ff3740008c0 busy</div><div>Mar 24 19:57:51 bufferoverflow sanlock[1208]: 2013-03-24 19:57:51+0200 7412 [4759]: 1083422e close_task_aio 1 0x7ff374000910 busy</div>
<div>Mar 24 19:57:51 bufferoverflow sanlock[1208]: 2013-03-24 19:57:51+0200 7412 [4759]: 1083422e close_task_aio 2 0x7ff374000960 busy</div><div>Mar 24 19:57:51 bufferoverflow sanlock[1208]: 2013-03-24 19:57:51+0200 7412 [4759]: 1083422e close_task_aio 3 0x7ff3740009b0 busy</div>
<div>Mar 24 19:57:51 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div><div>Mar 24 19:57:52 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div>
<div>Mar 24 19:57:53 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div><div>Mar 24 19:57:54 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div>
<div>Mar 24 19:57:55 bufferoverflow vdsm SuperVdsmProxy WARNING Connect to svdsm failed [Errno 2] No such file or directory</div><div>Mar 24 19:57:55 bufferoverflow vdsm Storage.Misc ERROR Panic: Couldn't connect to supervdsm</div>
<div>Mar 24 19:57:55 bufferoverflow respawn: slave '/usr/share/vdsm/vdsm' died, respawning slave</div><div>Mar 24 19:57:55 bufferoverflow vdsm fileUtils WARNING Dir /rhev/data-center/mnt already exists</div><div>
Mar 24 19:57:58 bufferoverflow vdsm vds WARNING Unable to load the json rpc server module. Please make sure it is installed.</div>
<div>Mar 24 19:57:58 bufferoverflow vdsm vm.Vm WARNING vmId=`4d3d81b3-d083-4569-acc2-8e631ed51843`::Unknown type found, device: '{'device': u'unix', 'alias': u'channel0', 'type': u'channel', 'address': {u'bus': u'0', u'controller': u'0', u'type': u'virtio-serial', u'port': u'1'}}' found</div>
<div>Mar 24 19:57:58 bufferoverflow vdsm vm.Vm WARNING vmId=`4d3d81b3-d083-4569-acc2-8e631ed51843`::Unknown type found, device: '{'device': u'unix', 'alias': u'channel1', 'type': u'channel', 'address': {u'bus': u'0', u'controller': u'0', u'type': u'virtio-serial', u'port': u'2'}}' found</div>
<div>Mar 24 19:57:58 bufferoverflow vdsm vm.Vm WARNING vmId=`4d3d81b3-d083-4569-acc2-8e631ed51843`::_readPauseCode unsupported by libvirt vm</div><div>Mar 24 19:57:58 bufferoverflow kernel: [ 7402.688177] ata1: hard resetting link</div>
<div>Mar 24 19:57:59 bufferoverflow kernel: [ 7402.994510] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)</div><div>Mar 24 19:57:59 bufferoverflow kernel: [ 7403.005510] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20120711/psargs-359)</div>
<div>Mar 24 19:57:59 bufferoverflow kernel: [ 7403.005517] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT0._GTF] (Node ffff880407c74d48), AE_NOT_FOUND (20120711/psparse-536)</div><div>Mar 24 19:57:59 bufferoverflow kernel: [ 7403.015485] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20120711/psargs-359)</div>
<div>Mar 24 19:57:59 bufferoverflow kernel: [ 7403.015493] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT0._GTF] (Node ffff880407c74d48), AE_NOT_FOUND (20120711/psparse-536)</div><div>Mar 24 19:57:59 bufferoverflow kernel: [ 7403.016061] ata1.00: configured for UDMA/133</div>
<div>Mar 24 19:57:59 bufferoverflow kernel: [ 7403.016066] ata1: EH complete</div><div>Mar 24 19:58:01 bufferoverflow sanlock[1208]: 2013-03-24 19:58:01+0200 7422 [4759]: 1083422e close_task_aio 0 0x7ff3740008c0 busy</div>
<div>Mar 24 19:58:01 bufferoverflow sanlock[1208]: 2013-03-24 19:58:01+0200 7422 [4759]: 1083422e close_task_aio 1 0x7ff374000910 busy</div><div>Mar 24 19:58:01 bufferoverflow sanlock[1208]: 2013-03-24 19:58:01+0200 7422 [4759]: 1083422e close_task_aio 2 0x7ff374000960 busy</div>
<div>Mar 24 19:58:01 bufferoverflow sanlock[1208]: 2013-03-24 19:58:01+0200 7422 [4759]: 1083422e close_task_aio 3 0x7ff3740009b0 busy</div><div>Mar 24 19:58:01 bufferoverflow kernel: [ 7405.714145] device-mapper: table: 253:0: multipath: error getting device</div>
<div>Mar 24 19:58:01 bufferoverflow kernel: [ 7405.714148] device-mapper: ioctl: error adding target to table</div><div>Mar 24 19:58:01 bufferoverflow kernel: [ 7405.715051] device-mapper: table: 253:0: multipath: error getting device</div>
<div>Mar 24 19:58:01 bufferoverflow kernel: [ 7405.715053] device-mapper: ioctl: error adding target to table</div></div><div><br></div><div>ata1 is a 500GB SSD. (only SATA device on the system except a DVD drive)</div><div>
<br></div><div>Yuval</div><div><br></div><br><div class="gmail_quote">On Sun, Mar 24, 2013 at 2:52 PM, Dan Kenigsberg <span dir="ltr"><<a href="mailto:danken@redhat.com" target="_blank">danken@redhat.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="im">On Fri, Mar 22, <a href="tel:2013" value="+9722013">2013</a> at 08:24:35PM +0200, Limor Gavish wrote:<br>
> Hello,<br>
><br>
> I am using Ovirt 3.2 on Fedora 18:<br>
> [wil@bufferoverflow ~]$ rpm -q vdsm<br>
> vdsm-4.10.3-7.fc18.x86_64<br>
><br>
> (the engine is built from sources).<br>
><br>
> I seem to have hit this bug:<br>
> <a href="https://bugzilla.redhat.com/show_bug.cgi?id=922515" target="_blank">https://bugzilla.redhat.com/show_bug.cgi?id=922515</a><br>
<br>
</div>This bug is only one part of the problem, but it's nasty enough that I<br>
have just suggested it as a fix to the ovirt-3.2 branch of vdsm:<br>
<a href="http://gerrit.ovirt.org/13303" target="_blank">http://gerrit.ovirt.org/13303</a><br>
<br>
Could you test if with it, vdsm relinquishes its spm role, and recovers<br>
as operational?<br>
<div class="HOEnZb"><div class="h5"><br>
><br>
> in the following configuration:<br>
> Single host (no migrations)<br>
> Created a VM, installed an OS inside (Fedora18)<br>
> stopped the VM.<br>
> created template from it.<br>
> Created an additional VM from the template using thin provision.<br>
> Started the second VM.<br>
><br>
> in addition to the errors in the logs the storage domains (both data and<br>
> ISO) crashed, i.e went to "unknown" and "inactive" states respectively.<br>
> (see the attached engine.log)<br>
><br>
> I attached the VDSM and engine logs.<br>
><br>
> is there a way to work around this problem?<br>
> It happens repeatedly.<br>
><br>
> Yuval Meir<br>
</div></div></blockquote></div><br></div></div>