[ovirt-users] Vm suddenly paused with error "vm has paused due to unknown storage error"
Jasper Siero
jasper.siero at target-holding.nl
Tue Oct 27 09:06:28 UTC 2015
Hello Markus,
Thanks for your reply. I think this will not work in our case because the libvirtd/vm process is not crashed and after unpausing the vm runs without problems.
Is it possible to unpause your vm in this situation or is the process really crashed?
Jasper
________________________________________
Van: Markus Stockhausen [stockhausen at collogia.de]
Verzonden: maandag 26 oktober 2015 19:47
Aan: Jasper Siero; users at ovirt.org
Onderwerp: AW: [ovirt-users] Vm suddenly paused with error "vm has paused due to unknown storage error"
Hi Jasper,
from time to time we see a similar behaviour. All of a sudden a VM pauses due to
some IO error. But it takes 5 months to occur. Our /var/log/libvirt/qemu/<vm>.log gives
qemu-system-x86_64: block.c:2806: bdrv_error_action: Assertion `error >= 0' failed.
Currently we are waiting to capture the next crash. I do not know if your error
allows to enforce cores for in depth analysis. If yes you should activate
1) /usr/lib/systemd/system/libvirtd.service
LimitCORE=infinity
2) /etc/security/limits.conf
* soft core unlimited
Markus
________________________________________
Von: users-bounces at ovirt.org [users-bounces at ovirt.org]" im Auftrag von "Jasper Siero [jasper.siero at target-holding.nl]
Gesendet: Montag, 26. Oktober 2015 17:39
An: users at ovirt.org
Betreff: [ovirt-users] Vm suddenly paused with error "vm has paused due to unknown storage error"
Hi all,
Since we upgraded our Ovirt nodes to CentOS 7 a vm (not a specific one but never more then one) will sometimes pause suddenly with the error "VM ... has paused due to unknown storage error". It happens now two times in a month.
The Ovirt node uses san storage for the vm's running on it. When a specific vm is pausing with an error the other vm's keeps running without problems.
The vm runs without problems after unpausing it.
Versions:
CentOS Linux release 7.1.1503
vdsm-4.14.17-0
libvirt-daemon-1.2.8-16
vdsm.log:
VM Channels Listener::DEBUG::2015-10-25 07:43:54,382::vmChannels::95::vds::(_handle_timeouts) Timeout on fileno 78.
libvirtEventLoop::INFO::2015-10-25 07:43:56,177::vm::4602::vm.Vm::(_onIOError) vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::abnormal vm stop device virtio-disk0 error eother
libvirtEventLoop::DEBUG::2015-10-25 07:43:56,178::vm::5204::vm.Vm::(_onLibvirtLifecycleEvent) vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::event Suspended detail 2 opaque None
libvirtEventLoop::INFO::2015-10-25 07:43:56,178::vm::4602::vm.Vm::(_onIOError) vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::abnormal vm stop device virtio-disk0 error eother
...........
libvirtEventLoop::INFO::2015-10-25 07:43:56,180::vm::4602::vm.Vm::(_onIOError) vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::abnormal vm stop device virtio-disk0 error eother
specific error part in libvirt vm log:
block I/O error in device 'drive-virtio-disk0': Unknown error 32758 (32758)
...........
block I/O error in device 'drive-virtio-disk0': Unknown error 32758 (32758)
engine.log:
2015-10-25 07:44:48,945 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-40) [a43dcc8] VM diataal-prod-cas1 77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb moved from
Up --> Paused
2015-10-25 07:44:49,003 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-40) [a43dcc8] Correlation ID: null, Call Stack: null, Custom Event
ID: -1, Message: VM diataal-prod-cas1 has paused due to unknown storage error.
Has anyone experienced the same problem or knows a way to solve this?
Kind regards,
Jasper
_______________________________________________
Users mailing list
Users at ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
More information about the Users
mailing list