Hi Lionel,

Sorry for the delay in replying you.

If it's possible from your side, syncing the data and destroying old disk sounds about right.

In addition, it seems like you're having this bug: https://bugzilla.redhat.com/show_bug.cgi?id=1509629
And it was fixed for version 4.1.9. and above.


Regards,
Shani Leviim

On Mon, Feb 26, 2018 at 4:18 PM, Lionel Caignec <caignec@cines.fr> wrote:
Ok so i reply myself,

Version is 4.1.7.6-1

I just delete manually a snapshot previously created. But this is an io intensive vm, whit big disk (2,5To, and 5To).

For the log, i cannot paste all my log on public list security reason, i will send you full in private.
Here is an extract relevant to my error
engine.log-20180210:2018-02-09 23:00:03,200+01 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-312) [44402a8c-3196-43f0-ba33-307ea78e6f49] EVENT_ID: USER_CREATE_SNAPSHOT(45), Correlation ID: 44402a8c-3196-43f0-ba33-307ea78e6f49, Job ID: 030cd310-fec9-4a89-8c3f-7888504fe973, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Snapshot 'AUTO_7D_zz_nil_20180209_220002' creation for VM 'zz_nil' was initiated by snap_user@internal.
engine.log-20180210:2018-02-09 23:01:06,578+01 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler5) [] EVENT_ID: USER_CREATE_SNAPSHOT_FINISHED_SUCCESS(68), Correlation ID: 44402a8c-3196-43f0-ba33-307ea78e6f49, Job ID: 030cd310-fec9-4a89-8c3f-7888504fe973, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Snapshot 'AUTO_7D_zz_nil_20180209_220002' creation for VM 'zz_nil' has been completed.
engine.log-20180220:2018-02-19 17:01:23,800+01 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-113) [] EVENT_ID: USER_REMOVE_SNAPSHOT(342), Correlation ID: 06a9efa4-1b80-4021-bf3e-41ecebe58a88, Job ID: c9a918a7-b00c-43cf-b6de-3659ac0765da, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Snapshot 'AUTO_7D_zz_nil_20180209_220002' deletion for VM 'zz_nil' was initiated by acaignec@ldap-cines-authz.
engine.log-20180221:2018-02-20 22:24:45,174+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler6) [06a9efa4-1b80-4021-bf3e-41ecebe58a88] EVENT_ID: USER_REMOVE_SNAPSHOT_FINISHED_FAILURE(357), Correlation ID: 06a9efa4-1b80-4021-bf3e-41ecebe58a88, Job ID: c9a918a7-b00c-43cf-b6de-3659ac0765da, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Failed to delete snapshot 'AUTO_7D_zz_nil_20180209_220002' for VM 'zz_nil'.
 2018-02-20 22:24:46,266+01 INFO  [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (DefaultQuartzScheduler3) [516079c3] SPMAsyncTask::PollTask: Polling task '34137342-4f30-476d-b16c-1cb7e0ea0ac0' (Parent Command 'DestroyImage', Parameters Type 'org.ovirt.engine.core.common.asynctasks.AsyncTaskParameters') returned status 'finished', result 'success'.
2018-02-20 22:24:46,267+01 INFO  [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (DefaultQuartzScheduler3) [516079c3] BaseAsyncTask::onTaskEndSuccess: Task '34137342-4f30-476d-b16c-1cb7e0ea0ac0' (Parent Command 'DestroyImage', Parameters Type 'org.ovirt.engine.core.common.asynctasks.AsyncTaskParameters') ended successfully.
2018-02-20 22:24:46,268+01 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (DefaultQuartzScheduler3) [516079c3] CommandAsyncTask::endActionIfNecessary: All tasks of command 'fe8c91f2-386b-4b3f-bbf3-aeda8e9244c6' has ended -> executing 'endAction'
2018-02-20 22:24:46,268+01 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (DefaultQuartzScheduler3) [516079c3] CommandAsyncTask::endAction: Ending action for '1' tasks (command ID: 'fe8c91f2-386b-4b3f-bbf3-aeda8e9244c6'): calling endAction '.
2018-02-20 22:24:46,268+01 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (org.ovirt.thread.pool-6-thread-20) [516079c3] CommandAsyncTask::endCommandAction [within thread] context: Attempting to endAction 'DestroyImage',
2018-02-20 22:24:46,269+01 ERROR [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (org.ovirt.thread.pool-6-thread-20) [516079c3] [within thread]: endAction for action type DestroyImage threw an exception.: java.lang.NullPointerException
        at org.ovirt.engine.core.bll.tasks.CoCoAsyncTaskHelper.endAction(CoCoAsyncTaskHelper.java:335) [bll.jar:]
        at org.ovirt.engine.core.bll.tasks.CommandCoordinatorImpl.endAction(CommandCoordinatorImpl.java:340) [bll.jar:]
        at org.ovirt.engine.core.bll.tasks.CommandAsyncTask.endCommandAction(CommandAsyncTask.java:154) [bll.jar:]
        at org.ovirt.engine.core.bll.tasks.CommandAsyncTask.lambda$endActionIfNecessary$0(CommandAsyncTask.java:106) [bll.jar:]
        at org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:84) [utils.jar:]
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [rt.jar:1.8.0_161]
        at java.util.concurrent.FutureTask.run(FutureTask.java:266) [rt.jar:1.8.0_161]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [rt.jar:1.8.0_161]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [rt.jar:1.8.0_161]
        at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_161]

----- Mail original -----
De: "Shani Leviim" <sleviim@redhat.com>
À: "Lionel Caignec" <caignec@cines.fr>
Envoyé: Lundi 26 Février 2018 14:42:38
Objet: Re: [ovirt-users] Ghost Snapshot Disk

Yes, please.
Can you detail a bit more regarding the actions you've done?

I'm assuming that since the snapshot had no description, trying to operate
it caused the nullPointerException you've got.
But I want to examine what was the cause for that.

Also, can you please answer back to the list?



*Regards,*

*Shani Leviim*

On Mon, Feb 26, 2018 at 3:37 PM, Lionel Caignec <caignec@cines.fr> wrote:

> Version is 4.1.7.6-1
>
> Do you want the log from the day i delete snapshot?
>
> ----- Mail original -----
> De: "Shani Leviim" <sleviim@redhat.com>
> À: "Lionel Caignec" <caignec@cines.fr>
> Cc: "users" <users@ovirt.org>
> Envoyé: Lundi 26 Février 2018 14:29:16
> Objet: Re: [ovirt-users] Ghost Snapshot Disk
>
> Hi,
>
> What is your engine version, please?
> I'm trying to reproduce your steps, for understanding better was is the
> cause for that error. Therefore, a full engine log is needed.
> Can you please attach it?
>
> Thanks,
>
>
> *Shani Leviim*
>
> On Mon, Feb 26, 2018 at 2:48 PM, Lionel Caignec <caignec@cines.fr> wrote:
>
> > Hi
> >
> > 1) this is error message from ui.log
> >
> >  2018-02-26 13:44:10,001+01 ERROR [org.ovirt.engine.ui.frontend.
> > server.gwt.OvirtRemoteLoggingService] (default task-3) [] Permutation
> > name: 8C01181C3B121D0AAE1312275CC96415
> > 2018-02-26 13:44:10,001+01 ERROR [org.ovirt.engine.ui.frontend.
> server.gwt.OvirtRemoteLoggingService]
> > (default task-3) [] Uncaught exception: com.google.gwt.core.client.
> JavaScriptException:
> > (TypeError)
> >  __gwt$exception: <skipped>: Cannot read property 'F' of null
> >         at org.ovirt.engine.ui.uicommonweb.models.storage.
> > DisksAllocationModel$3.$onSuccess(DisksAllocationModel.java:120)
> >         at org.ovirt.engine.ui.uicommonweb.models.storage.
> > DisksAllocationModel$3.onSuccess(DisksAllocationModel.java:120)
> >         at org.ovirt.engine.ui.frontend.Frontend$2.$onSuccess(
> Frontend.java:233)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.Frontend$2.onSuccess(Frontend.
> java:233)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > OperationProcessor$2.$onSuccess(OperationProcessor.java:139)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > OperationProcessor$2.onSuccess(OperationProcessor.java:139)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > GWTRPCCommunicationProvider$5$1.$onSuccess(GWTRPCCommunicationProvider.
> java:269)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > GWTRPCCommunicationProvider$5$1.onSuccess(GWTRPCCommunicationProvider.
> java:269)
> > [frontend.jar:]
> >         at com.google.gwt.user.client.rpc.impl.RequestCallbackAdapter.
> > onResponseReceived(RequestCallbackAdapter.java:198) [gwt-servlet.jar:]
> >         at com.google.gwt.http.client.Request.$fireOnResponseReceived(
> Request.java:237)
> > [gwt-servlet.jar:]
> >         at com.google.gwt.http.client.RequestBuilder$1.
> onReadyStateChange(RequestBuilder.java:409)
> > [gwt-servlet.jar:]
> >         at Unknown.eval(webadmin-0.js@65)
> >         at com.google.gwt.core.client.impl.Impl.apply(Impl.java:296)
> > [gwt-servlet.jar:]
> >         at com.google.gwt.core.client.impl.Impl.entry0(Impl.java:335)
> > [gwt-servlet.jar:]
> >         at Unknown.eval(webadmin-0.js@54)
> >
> >
> > 2) This line seems to be about the bad disk :
> >
> >  f242cc9a-56c1-4ae4-aef0-f75eb01f74b1 | 2018-01-17 22:02:00+01 |
> > 2748779069440 | 00000000-0000-0000-0000-000000000000 |
> > 00000000-0000-0000-0000-000000000000 |           4 | 2018-01-18
> > 22:01:20.5+01   | 0dd2090c-3491-4fa1-98c3-54ae88be793c
> >
> >
> > 3) Snapshot table is empty for the concerned vm_id.
> >
> > ----- Mail original -----
> > De: "Shani Leviim" <sleviim@redhat.com>
> > À: "Lionel Caignec" <caignec@cines.fr>
> > Cc: "users" <users@ovirt.org>
> > Envoyé: Lundi 26 Février 2018 13:31:23
> > Objet: Re: [ovirt-users] Ghost Snapshot Disk
> >
> > Hi Lionel,
> >
> > The error message you've mentioned sounds like a UI error.
> > Can you please attach your ui log?
> >
> > Also, on the data from 'images' table you've uploaded, can you describe
> > which line is the relevant disk?
> >
> > Finally (for now), in case the snapshot was deleted, can you please
> > validate it by viewing the output of:
> > $ select * from snapshots;
> >
> >
> >
> > *Regards,*
> >
> > *Shani Leviim*
> >
> > On Mon, Feb 26, 2018 at 9:20 AM, Lionel Caignec <caignec@cines.fr>
> wrote:
> >
> > > Hi Shani,
> > > thank you for helping me with your reply,
> > > i juste make a little mistake on explanation. In fact it's the snapshot
> > > does not exist anymore. This is the disk(s) relative to her wich still
> > > exist, and perhaps LVM volume.
> > > So can i delete manually this disk in database? what about the lvm
> > volume?
> > > Is it better to recreate disk sync data and destroy old one?
> > >
> > >
> > >
> > > ----- Mail original -----
> > > De: "Shani Leviim" <sleviim@redhat.com>
> > > À: "Lionel Caignec" <caignec@cines.fr>
> > > Cc: "users" <users@ovirt.org>
> > > Envoyé: Dimanche 25 Février 2018 14:26:41
> > > Objet: Re: [ovirt-users] Ghost Snapshot Disk
> > >
> > > Hi Lionel,
> > >
> > > You can try to delete that snapshot directly from the database.
> > >
> > > In case of using psql  [1], once you've logged in to your database, you
> > can
> > > run this query:
> > > $ select * from snapshots where vm_id = '<your_vm_id>';
> > > This one would list the snapshots associated with a VM by its id.
> > >
> > > In case you don't have you vm_id, you can locate it by querying:
> > > $ select * from vms where vm_name = 'nil';
> > > This one would show you some details about a VM by its name (including
> > the
> > > vm's id).
> > >
> > > Once you've found the relevant snapshot, you can delete it by running:
> > > $ delete from snapshots where snapshot_id = '<your_snapshot_id>';
> > > This one would delete the desired snapshot from the database.
> > >
> > > Since it's a delete operation, I would suggest confirming the ids
> before
> > > executing it.
> > >
> > > Hope you've found it useful!
> > >
> > > [1]
> > > https://www.ovirt.org/documentation/install-guide/
> > appe-Preparing_a_Remote_
> > > PostgreSQL_Database_for_Use_with_the_oVirt_Engine/
> > >
> > >
> > > *Regards,*
> > >
> > > *Shani Leviim*
> > >
> > > On Fri, Feb 23, 2018 at 9:25 AM, Lionel Caignec <caignec@cines.fr>
> > wrote:
> > >
> > > > Hi,
> > > >
> > > > i've a problem with snapshot. On one VM i've a "snapshot" ghost
> without
> > > > name or uuid, only information is size (see attachment). In the
> > snapshot
> > > > tab there is no trace about this disk.
> > > >
> > > > In database (table images) i found this :
> > > >   f242cc9a-56c1-4ae4-aef0-f75eb01f74b1 | 2018-01-17 22:02:00+01 |
> > > > 2748779069440 | 00000000-0000-0000-0000-000000000000 |
> > > > 00000000-0000-0000-0000-000000000000 |           4 | 2018-01-18
> > > > 22:01:20.5+01   | 0dd2090c-3491-4fa1-98c3-54ae88be793c
> > > >  |           2 |             4 | 17e26476-cecb-441d-a5f7-
> 46ab3ef387ee
> > |
> > > > 2018-01-17 22:01:29.663334+01 | 2018-01-19 08:40:14.345229+01 | f
> > |
> > > >                  1 |           2
> > > >  1c7650fa-542b-4ec2-83a1-d2c1c31be5fd | 2018-01-17 22:02:03+01 |
> > > > 5368709120000 | 00000000-0000-0000-0000-000000000000 |
> > > > 00000000-0000-0000-0000-000000000000 |           4 | 2018-01-18
> > > > 22:01:20.84+01  | 0dd2090c-3491-4fa1-98c3-54ae88be793c
> > > >  |           2 |             4 | bf834a91-c69f-4d2c-b639-
> 116ed58296d8
> > |
> > > > 2018-01-17 22:01:29.836133+01 | 2018-01-19 08:40:19.083508+01 | f
> > |
> > > >                  1 |           2
> > > >  8614b21f-c0de-40f2-b4fb-e5cf193b0743 | 2018-02-09 23:00:44+01 |
> > > > 5368709120000 | 00000000-0000-0000-0000-000000000000 |
> > > > 00000000-0000-0000-0000-000000000000 |           4 | 2018-02-16
> > > > 23:00:02.855+01 | 390175dc-baf4-4831-936a-5ea68fa4c969
> > > >
> > > >
> > > >  But i does not know which line is my disk. Is it possible to delete
> > > > directly into database?
> > > >  Or is it better to dump my disk to another new and delete the
> > "corrupted
> > > > one"?
> > > >
> > > >  Another thing, when i try to move the disk to another storage
> domain i
> > > > always get "uncaght exeption occured ..." and no error in engine.log.
> > > >
> > > >
> > > >  Thank you for helping.
> > > >
> > > > --
> > > > Lionel Caignec
> > > >
> > > > _______________________________________________
> > > > Users mailing list
> > > > Users@ovirt.org
> > > > http://lists.ovirt.org/mailman/listinfo/users
> > > >
> > > >
> > >
> >
>