[snip]
Looking at both "abnormal vm stop" and storage refresh events:
[snip]
I don't see any relation between refreshes and the abnormal vm stop events.
I think the key to understanding this is to enable more verbose logs
in gluster understand what was
the failure that caused the vm to stop.
Ah, ok. Thanks
It seems default gluster logs level are INFO and I can have them more verbose for a limited amount of time seeing if more information is provided.
Can I do it with VMs running and only doing sort of reload of the service or do I have to stop all to do it?
It would also help if we had detailed error logs in qemu log in
/var/log/libvirt/qemu/vm-name.log
I will find them. The system is not available to check right now
Did you enable libvirt logs? We may have more information about the error there.
You can enable logs by modifying these lines in /etc/libvirt/libvirtd.conf:
log_filters="1:qemu 1:libvirt 4:object 4:json 4:event 1:util"
log_outputs="1:file:/var/log/libvirt/libvirtd.log"
And restart libvirt.
Note that libvirt log may be huge, so you need to watch it and change
the log level or filter after you collect what you need.
To log only warning and errors use:
log_outputs="3:file:/var/log/libvirt/libvirtd.log"
Someone from gluster should help debugging this.
Nir
Ok, I could also try this way if enabling more verbose gluster logs is not sufficient.
Thanks for all detailed explanations, in the mean time.
Gianluca