Hello Markus,
Thanks for your reply. I think this will not work in our case because the libvirtd/vm
process is not crashed and after unpausing the vm runs without problems.
Is it possible to unpause your vm in this situation or is the process really crashed?
Jasper
________________________________________
Van: Markus Stockhausen [stockhausen(a)collogia.de]
Verzonden: maandag 26 oktober 2015 19:47
Aan: Jasper Siero; users(a)ovirt.org
Onderwerp: AW: [ovirt-users] Vm suddenly paused with error "vm has paused due to
unknown storage error"
Hi Jasper,
from time to time we see a similar behaviour. All of a sudden a VM pauses due to
some IO error. But it takes 5 months to occur. Our /var/log/libvirt/qemu/<vm>.log
gives
qemu-system-x86_64: block.c:2806: bdrv_error_action: Assertion `error >= 0'
failed.
Currently we are waiting to capture the next crash. I do not know if your error
allows to enforce cores for in depth analysis. If yes you should activate
1) /usr/lib/systemd/system/libvirtd.service
LimitCORE=infinity
2) /etc/security/limits.conf
* soft core unlimited
Markus
________________________________________
Von: users-bounces(a)ovirt.org [users-bounces(a)ovirt.org]&quot; im Auftrag von
"Jasper Siero [jasper.siero(a)target-holding.nl]
Gesendet: Montag, 26. Oktober 2015 17:39
An: users(a)ovirt.org
Betreff: [ovirt-users] Vm suddenly paused with error "vm has paused due to unknown
storage error"
Hi all,
Since we upgraded our Ovirt nodes to CentOS 7 a vm (not a specific one but never more then
one) will sometimes pause suddenly with the error "VM ... has paused due to unknown
storage error". It happens now two times in a month.
The Ovirt node uses san storage for the vm's running on it. When a specific vm is
pausing with an error the other vm's keeps running without problems.
The vm runs without problems after unpausing it.
Versions:
CentOS Linux release 7.1.1503
vdsm-4.14.17-0
libvirt-daemon-1.2.8-16
vdsm.log:
VM Channels Listener::DEBUG::2015-10-25
07:43:54,382::vmChannels::95::vds::(_handle_timeouts) Timeout on fileno 78.
libvirtEventLoop::INFO::2015-10-25 07:43:56,177::vm::4602::vm.Vm::(_onIOError)
vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::abnormal vm stop device virtio-disk0 error
eother
libvirtEventLoop::DEBUG::2015-10-25
07:43:56,178::vm::5204::vm.Vm::(_onLibvirtLifecycleEvent)
vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::event Suspended detail 2 opaque None
libvirtEventLoop::INFO::2015-10-25 07:43:56,178::vm::4602::vm.Vm::(_onIOError)
vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::abnormal vm stop device virtio-disk0 error
eother
...........
libvirtEventLoop::INFO::2015-10-25 07:43:56,180::vm::4602::vm.Vm::(_onIOError)
vmId=`77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb`::abnormal vm stop device virtio-disk0 error
eother
specific error part in libvirt vm log:
block I/O error in device 'drive-virtio-disk0': Unknown error 32758 (32758)
...........
block I/O error in device 'drive-virtio-disk0': Unknown error 32758 (32758)
engine.log:
2015-10-25 07:44:48,945 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo]
(DefaultQuartzScheduler_Worker-40) [a43dcc8] VM diataal-prod-cas1
77f07ae0-cc3e-4ae2-90ec-7fba7b11deeb moved from
Up --> Paused
2015-10-25 07:44:49,003 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(DefaultQuartzScheduler_Worker-40) [a43dcc8] Correlation ID: null, Call Stack: null,
Custom Event
ID: -1, Message: VM diataal-prod-cas1 has paused due to unknown storage error.
Has anyone experienced the same problem or knows a way to solve this?
Kind regards,
Jasper
_______________________________________________
Users mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users