[Users] Why VDSM daemon resets SATA link too often

Adam Tkáč vonsch at gmail.com
Fri Nov 2 09:42:49 UTC 2012


Hello all,

I'm experiencing strange issue on one of the nodes. There is only one
VM on the node but it generates _a lot_ of disk I/O.

Sometimes vdsm writes following error to the system log:

Nov  1 14:45:17 virtualizer respawn: slave '/usr/share/vdsm/vdsm'
died, respawning slave
Nov  1 14:45:20 virtualizer vdsm vds ERROR Unable to load the rest
server module. Please make sure it is installed.
Nov  1 14:45:20 virtualizer kernel: [105149.428697] ata1: hard resetting link
Nov  1 14:45:21 virtualizer kernel: [105149.758448] ata1: SATA link
down (SStatus 0 SControl 300)
Nov  1 14:45:21 virtualizer kernel: [105149.784996] ata1: EH complete
Nov  1 14:45:21 virtualizer vdsm vm.Vm WARNING
vmId=`6c6fc6f7-1b61-484b-9e44-0e9bb57ac1c9`::Unknown type found,
device: '{'device': u'unix', 'alias': u'channel0', 'type': u'channel',
'address': {u'bus': u'0', u'controller': u'0', u'type':
u'virtio-serial', u'port': u'1'}}' found
Nov  1 14:45:21 virtualizer kernel: [105150.159060] ata2: EH complete
Nov  1 14:45:21 virtualizer kernel: [105150.181694] ata3: hard resetting link
Nov  1 14:45:21 virtualizer kernel: [105150.510474] ata3: SATA link
down (SStatus 0 SControl 300)
<repeated for every ata* link...>
Nov  1 14:45:25 virtualizer vdsm Storage.LVM WARNING lvm vgs failed: 5
[] ['  Volume group "752f4ef6-2e3a-4cec-aad9-848a8b8b9e80" not found']
Nov  1 14:55:41 virtualizer vdsm vds ERROR connection to libvirt
broken. taking vdsm down.
Nov  1 14:55:41 virtualizer vdsm root ERROR client ('192.168.30.140', 34435)
Nov  1 14:55:42 virtualizer vdsm vm.Vm ERROR
vmId=`6c6fc6f7-1b61-484b-9e44-0e9bb57ac1c9`::Stats function failed:
<AdvancedStatsFunction _sampleNet at 0x24fc1e0>
Nov  1 14:55:42 virtualizer vdsm Storage.LVM WARNING lvm vgs failed: 5
[] ['  Volume group "035e2517-c12b-4c7f-b80e-695fec96757e" not found']
Nov  1 14:55:42 virtualizer vdsm Storage.LVM WARNING lvm vgs failed: 5
[] ['  Volume group "035e2517-c12b-4c7f-b80e-695fec96757e" not found']

The "752f4ef6-2e3a-4cec-aad9-848a8b8b9e80" is LOCALFS storage and VM
image is stored there.

After that VM becomes unresponsible and I have to kill qemu-kvm
process manually. The VM's image is stored on XFS system and after
SATA reset it takes some time before XFS responds. During this time
libvirtd daemon is not responsible and vdsm timeouts.

Is it possible to configure vdsm not to reset SATA link every time
when it starts? And if not, is it possible to increase timeout how
long vdsm waits for libvirtd response?

Thank you in advance.

Regards, Adam



More information about the Users mailing list