[ovirt-users] Snapshot is "Locked" - Can't do anything with the VM
Ivo Rütsche
lists.ovirt.org at direct.ruetsche.com
Wed May 18 10:50:06 EDT 2016
Hi Nathanaël
I try it today and the snapshot was in the state "OK" after I execute
the script. I try to remove the snapshot again, but it never do
something. So after I click "Delete", i have this situation:
Tasks(1):
- (Hourglass) Removing Snapshot Auto-generated for Live Storage
Migration of VM xxxx
-- (Completed) Validating
-- (Hourglass) Executing
The snapshot is in the status "Locked"
This status remains for ever.
When I restart the engine, the Status is still in the status "Locked",
but the task no longer exists.
engine.log:
2016-05-18 16:33:07,532 INFO
[org.ovirt.engine.core.bll.RemoveSnapshotCommand] (default task-23)
[53281b57] Lock Acquired to object
'EngineLock:{exclusiveLocks='[2acfe3fa-be69-46da-9add-c748b2bd6034=<VM,
ACTION_TYPE_FAILED_OBJECT_LOCKED>]', sharedLocks='null'}'
2016-05-18 16:33:08,640 INFO
[org.ovirt.engine.core.bll.RemoveSnapshotCommand] (default task-23)
[53281b57] Running command: RemoveSnapshotCommand internal: false.
Entities affected : ID: 2acfe3fa-be69-46da-9add-c748b2bd6034 Type:
VMAction group MANIPULATE_VM_SNAPSHOTS with role type USER
2016-05-18 16:33:08,984 INFO
[org.ovirt.engine.core.bll.RemoveSnapshotCommand] (default task-23)
[53281b57] Lock freed to object
'EngineLock:{exclusiveLocks='[2acfe3fa-be69-46da-9add-c748b2bd6034=<VM,
ACTION_TYPE_FAILED_OBJECT_LOCKED>]', sharedLocks='null'}'
2016-05-18 16:33:10,079 INFO
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(default task-23) [] Correlation ID: 53281b57, Job ID:
58b18e63-2b7f-480c-84ea-260508aa8798, Call Stack: null, Custom Event ID:
-1, Message: Snapshot 'Auto-generated for Live Storage Migration'
deletion for VM 'xxxx' was initiated by admin at internal.
....but nothing else....
On the node side, i can't find anything in the logs (or i looking on the
wrong place).
When the VM is not running, I can remove the snapshot without any
problems. The VM images are on a NFS export from a Solaris (ZFS).
gruss Ivo
On 04.05.2016 13:40, Nathanaël Blanchet wrote:
> Unlock procedure
>
> * QUERY : -q
>
> PGPASSWORD=XXXXX
> /usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh -q -t snapshot
> -u engine
> 296c010e-3c1d-4008-84b3-5cd39cff6aa1 |
> 525a4dda-dbbb-4872-a5f1-8ac2aed48392
>
> * REMOVE
>
> PGPASSWORD=XXXXX
> /usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh -t snapshot -u
> engine 525a4dda-dbbb-4872-a5f1-8ac2aed48392
>
> Ref :
> http://lists.ovirt.org/pipermail/users/2015-November/035686.html
>
> Le 03/05/2016 21:17, Ivo Rütsche a écrit :
>>
>> I just do a full clone of the whole VM and the clone is running fine
>> and have the last data on all disks.
>>
>> gruss ivo
>>
>>
>> On 03.05.2016 20:12, Markus Stockhausen wrote:
>>> Hi,
>>>
>>> we've seen a lot of trouble when deleting snapshots of multidisk VMs.
>>> Especially endless running snapshot deletions is one of our culprits.
>>> More here: https://bugzilla.redhat.com/show_bug.cgi?id=1319400
>>>
>>> Markus
>>> ________________________________________
>>> Von: users-bounces at ovirt.org [users-bounces at ovirt.org]" im
>>> Auftrag von "Ivo Rütsche [lists.ovirt.org at direct.ruetsche.com]
>>> Gesendet: Dienstag, 3. Mai 2016 19:23
>>> An: users at ovirt.org
>>> Betreff: [ovirt-users] Snapshot is "Locked" - Can't do anything with
>>> the VM
>>>
>>> Hi oVirt List
>>>
>>> We have the problem, that two of our VM's have "Locked" snapshots
>>> and we
>>> can't go forward. From 10 VM's which we move from one store to another,
>>> we have problems with 5 of them to delete the snapshots. But with a lot
>>> of luck, the snapshot was always on the dedicated Swap-Disk, so we just
>>> have to remove the whole disk and create it again as new.
>>>
>>> Because we replace all the filers, we have to move the VM's arround.
>>> The
>>> move itself runns fine, but deleting the snapshots doesn't work
>>> sometimes.
>>>
>>> I found a lot about our problem, but I can't find any solution, which
>>> works an our environment.
>>>
>>> Our setup:
>>> - 3 Solaris filer with NFS exports for the hosts for the vm-images
>>> - 8 hosts (oVirt 3.6 / Centos 7.2.1511)
>>> - The engine host is located outside of the DC, connected via VPN
>>>
>>> At the moment, one of the "problem-VM" is productive, the other should
>>> become productive in the next days, so we focus us to the 2nd VM,
>>> because we can play arround with them. The other one is startet up, but
>>> i have no idea, what happens, when i shut them down...
>>>
>>> This is the status today of VM, which is not productive at the moment:
>>> - VM Status: Down
>>> - Snapshots OK: 2
>>> - Snapshots Locked: 1
>>> - In the snapshot tab, everything is grayed out (Create, Preview,
>>> Commit, Undo etc.)
>>> - No active tasks are displayed
>>>
>>> Error message when i try to start the VM:
>>> "Cannot run VM. The VM is performing an operation on a Snapshot. Please
>>> wait for the operation to finish, and try again."
>>>
>>> Something about the history of this VM:
>>> 1. We start the disk move from filer1 to filer3, totaly 4 drives,
>>> everything was fine
>>> 2. Deleting snapshots, i think one or two runns fine
>>> 3. Deleting the 2nd or 3rd snapshot runs two days without an end
>>> 4. Rebooting the Engine
>>> 5. No task anymore in the Tasks list
>>> 6. The status of the snapshot is "Locked", die Disk of this snapshot
>>> becoms "Illegal" and red
>>> 7. Shutdown the VM - no luck, i can't start it again --> "Cannot run
>>> VM. The VM is performing an operation on a Snapshot. Please wait for
>>> the
>>> operation to finish, and try again."
>>> 8. We move the disk back from filer3 to the filer1
>>> 9. The status of the snapshot is "Locked", die Disk of this snapshot
>>> becoms "OK" and green
>>> ...but i can't delete the snapshot or starting the VM :(
>>>
>>> Today, i copy the image files to a dedicated KVM host, which is not
>>> involved in the oVirt environment and i merge the image manualy
>>> together:
>>> qemu-img convert -p -f qcow2 804ffcc3-446f-4c56-ba99-ab90c7951a6f -O
>>> raw
>>> Merged1
>>>
>>> I mount the image and everything fine, all the changes from the last
>>> two
>>> days are now in a base image. I try to remove the whole disk in oVirt,
>>> but i still got the error message, that i have to wait. So for me, the
>>> next step is to create a new VM and migrate everything manualy, this
>>> will work, but can't be the solution for the other VM, which is
>>> productive and it have the same problem.
>>>
>>>
>>>
>>> Some information about:
>>>
>>> *****************
>>> [root at lin-ovirt7 images]# vdsm-tool dump-volume-chains
>>> d6e6656d-7aaf-4c5b-bb93-cd6134ed9654
>>>
>>> Images volume chains (base volume first)
>>>
>>> image: f3cfefd2-d467-41d5-a054-5b01f81a21b4
>>>
>>> - a018521a-bd3a-4322-ac79-52d15e95639e
>>> status: OK, voltype: INTERNAL, format: RAW, legality:
>>> LEGAL, type: SPARSE
>>>
>>> - 2e2f0b4a-b015-4b50-8d10-51726de460a9
>>> status: OK, voltype: INTERNAL, format: COW, legality:
>>> LEGAL, type: SPARSE
>>>
>>> - 804ffcc3-446f-4c56-ba99-ab90c7951a6f
>>> status: OK, voltype: LEAF, format: COW, legality:
>>> LEGAL,
>>> type: SPARSE
>>> *****************
>>> [root at lin-ovirt7 images]# ls -altrh
>>> f3cfefd2-d467-41d5-a054-5b01f81a21b4/
>>> total 34G
>>> -rw-rw----+ 1 vdsm kvm 1.0M 2. Mai 21:03
>>> a018521a-bd3a-4322-ac79-52d15e95639e.lease
>>> -rw-rw----+ 1 vdsm kvm 1.0M 2. Mai 21:03
>>> 2e2f0b4a-b015-4b50-8d10-51726de460a9.lease
>>> -rw-r--r--+ 1 vdsm kvm 260 2. Mai 21:03
>>> 804ffcc3-446f-4c56-ba99-ab90c7951a6f.meta
>>> -rw-rw----+ 1 vdsm kvm 1.0M 2. Mai 21:03
>>> 804ffcc3-446f-4c56-ba99-ab90c7951a6f.lease
>>> -rw-rw----+ 1 vdsm kvm 300G 2. Mai 22:26
>>> a018521a-bd3a-4322-ac79-52d15e95639e
>>> -rw-rw----+ 1 vdsm kvm 77M 2. Mai 22:27
>>> 2e2f0b4a-b015-4b50-8d10-51726de460a9
>>> -rw-rw----+ 1 vdsm kvm 3.5G 2. Mai 22:34
>>> 804ffcc3-446f-4c56-ba99-ab90c7951a6f
>>> -rw-r--r--+ 1 vdsm kvm 364 2. Mai 22:34
>>> a018521a-bd3a-4322-ac79-52d15e95639e.meta
>>> -rw-r--r--+ 1 vdsm kvm 264 2. Mai 22:34
>>> 2e2f0b4a-b015-4b50-8d10-51726de460a9.meta
>>> drwxr-xr-x+ 2 vdsm kvm 11 2. Mai 22:34 .
>>> drwxr-xr-x+ 50 vdsm kvm 50 3. Mai 14:30 ..
>>> [root at lin-ovirt7 images]#
>>> *****************
>>>
>>> I attach some line from the engine.log, maybe someone can help, if you
>>> need more information everything is here.
>>>
>>> gruss ivo
>>>
>>>
>>>
>>
>> _______________________________________________
>> Users mailing list
>> Users at ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>
> --
> Nathanaël Blanchet
>
> Supervision réseau
> Pôle Infrastrutures Informatiques
> 227 avenue Professeur-Jean-Louis-Viala
> 34193 MONTPELLIER CEDEX 5
> Tél. 33 (0)4 67 54 84 55
> Fax 33 (0)4 67 54 84 14
> blanchet at abes.fr
More information about the Users
mailing list