[ovirt-users] Ghost Snapshot Disk

Lionel Caignec caignec at cines.fr
Mon Feb 26 14:18:11 UTC 2018


Ok so i reply myself,

Version is 4.1.7.6-1

I just delete manually a snapshot previously created. But this is an io intensive vm, whit big disk (2,5To, and 5To).

For the log, i cannot paste all my log on public list security reason, i will send you full in private.
Here is an extract relevant to my error 
engine.log-20180210:2018-02-09 23:00:03,200+01 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-312) [44402a8c-3196-43f0-ba33-307ea78e6f49] EVENT_ID: USER_CREATE_SNAPSHOT(45), Correlation ID: 44402a8c-3196-43f0-ba33-307ea78e6f49, Job ID: 030cd310-fec9-4a89-8c3f-7888504fe973, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Snapshot 'AUTO_7D_zz_nil_20180209_220002' creation for VM 'zz_nil' was initiated by snap_user at internal.
engine.log-20180210:2018-02-09 23:01:06,578+01 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler5) [] EVENT_ID: USER_CREATE_SNAPSHOT_FINISHED_SUCCESS(68), Correlation ID: 44402a8c-3196-43f0-ba33-307ea78e6f49, Job ID: 030cd310-fec9-4a89-8c3f-7888504fe973, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Snapshot 'AUTO_7D_zz_nil_20180209_220002' creation for VM 'zz_nil' has been completed.
engine.log-20180220:2018-02-19 17:01:23,800+01 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-113) [] EVENT_ID: USER_REMOVE_SNAPSHOT(342), Correlation ID: 06a9efa4-1b80-4021-bf3e-41ecebe58a88, Job ID: c9a918a7-b00c-43cf-b6de-3659ac0765da, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Snapshot 'AUTO_7D_zz_nil_20180209_220002' deletion for VM 'zz_nil' was initiated by acaignec at ldap-cines-authz.
engine.log-20180221:2018-02-20 22:24:45,174+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler6) [06a9efa4-1b80-4021-bf3e-41ecebe58a88] EVENT_ID: USER_REMOVE_SNAPSHOT_FINISHED_FAILURE(357), Correlation ID: 06a9efa4-1b80-4021-bf3e-41ecebe58a88, Job ID: c9a918a7-b00c-43cf-b6de-3659ac0765da, Call Stack: null, Custom ID: null, Custom Event ID: -1, Message: Failed to delete snapshot 'AUTO_7D_zz_nil_20180209_220002' for VM 'zz_nil'.
 2018-02-20 22:24:46,266+01 INFO  [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (DefaultQuartzScheduler3) [516079c3] SPMAsyncTask::PollTask: Polling task '34137342-4f30-476d-b16c-1cb7e0ea0ac0' (Parent Command 'DestroyImage', Parameters Type 'org.ovirt.engine.core.common.asynctasks.AsyncTaskParameters') returned status 'finished', result 'success'.
2018-02-20 22:24:46,267+01 INFO  [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (DefaultQuartzScheduler3) [516079c3] BaseAsyncTask::onTaskEndSuccess: Task '34137342-4f30-476d-b16c-1cb7e0ea0ac0' (Parent Command 'DestroyImage', Parameters Type 'org.ovirt.engine.core.common.asynctasks.AsyncTaskParameters') ended successfully.
2018-02-20 22:24:46,268+01 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (DefaultQuartzScheduler3) [516079c3] CommandAsyncTask::endActionIfNecessary: All tasks of command 'fe8c91f2-386b-4b3f-bbf3-aeda8e9244c6' has ended -> executing 'endAction'
2018-02-20 22:24:46,268+01 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (DefaultQuartzScheduler3) [516079c3] CommandAsyncTask::endAction: Ending action for '1' tasks (command ID: 'fe8c91f2-386b-4b3f-bbf3-aeda8e9244c6'): calling endAction '.
2018-02-20 22:24:46,268+01 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (org.ovirt.thread.pool-6-thread-20) [516079c3] CommandAsyncTask::endCommandAction [within thread] context: Attempting to endAction 'DestroyImage',
2018-02-20 22:24:46,269+01 ERROR [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (org.ovirt.thread.pool-6-thread-20) [516079c3] [within thread]: endAction for action type DestroyImage threw an exception.: java.lang.NullPointerException
        at org.ovirt.engine.core.bll.tasks.CoCoAsyncTaskHelper.endAction(CoCoAsyncTaskHelper.java:335) [bll.jar:]
        at org.ovirt.engine.core.bll.tasks.CommandCoordinatorImpl.endAction(CommandCoordinatorImpl.java:340) [bll.jar:]
        at org.ovirt.engine.core.bll.tasks.CommandAsyncTask.endCommandAction(CommandAsyncTask.java:154) [bll.jar:]
        at org.ovirt.engine.core.bll.tasks.CommandAsyncTask.lambda$endActionIfNecessary$0(CommandAsyncTask.java:106) [bll.jar:]
        at org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:84) [utils.jar:]
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [rt.jar:1.8.0_161]
        at java.util.concurrent.FutureTask.run(FutureTask.java:266) [rt.jar:1.8.0_161]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [rt.jar:1.8.0_161]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [rt.jar:1.8.0_161]
        at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_161]

----- Mail original -----
De: "Shani Leviim" <sleviim at redhat.com>
À: "Lionel Caignec" <caignec at cines.fr>
Envoyé: Lundi 26 Février 2018 14:42:38
Objet: Re: [ovirt-users] Ghost Snapshot Disk

Yes, please.
Can you detail a bit more regarding the actions you've done?

I'm assuming that since the snapshot had no description, trying to operate
it caused the nullPointerException you've got.
But I want to examine what was the cause for that.

Also, can you please answer back to the list?



*Regards,*

*Shani Leviim*

On Mon, Feb 26, 2018 at 3:37 PM, Lionel Caignec <caignec at cines.fr> wrote:

> Version is 4.1.7.6-1
>
> Do you want the log from the day i delete snapshot?
>
> ----- Mail original -----
> De: "Shani Leviim" <sleviim at redhat.com>
> À: "Lionel Caignec" <caignec at cines.fr>
> Cc: "users" <users at ovirt.org>
> Envoyé: Lundi 26 Février 2018 14:29:16
> Objet: Re: [ovirt-users] Ghost Snapshot Disk
>
> Hi,
>
> What is your engine version, please?
> I'm trying to reproduce your steps, for understanding better was is the
> cause for that error. Therefore, a full engine log is needed.
> Can you please attach it?
>
> Thanks,
>
>
> *Shani Leviim*
>
> On Mon, Feb 26, 2018 at 2:48 PM, Lionel Caignec <caignec at cines.fr> wrote:
>
> > Hi
> >
> > 1) this is error message from ui.log
> >
> >  2018-02-26 13:44:10,001+01 ERROR [org.ovirt.engine.ui.frontend.
> > server.gwt.OvirtRemoteLoggingService] (default task-3) [] Permutation
> > name: 8C01181C3B121D0AAE1312275CC96415
> > 2018-02-26 13:44:10,001+01 ERROR [org.ovirt.engine.ui.frontend.
> server.gwt.OvirtRemoteLoggingService]
> > (default task-3) [] Uncaught exception: com.google.gwt.core.client.
> JavaScriptException:
> > (TypeError)
> >  __gwt$exception: <skipped>: Cannot read property 'F' of null
> >         at org.ovirt.engine.ui.uicommonweb.models.storage.
> > DisksAllocationModel$3.$onSuccess(DisksAllocationModel.java:120)
> >         at org.ovirt.engine.ui.uicommonweb.models.storage.
> > DisksAllocationModel$3.onSuccess(DisksAllocationModel.java:120)
> >         at org.ovirt.engine.ui.frontend.Frontend$2.$onSuccess(
> Frontend.java:233)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.Frontend$2.onSuccess(Frontend.
> java:233)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > OperationProcessor$2.$onSuccess(OperationProcessor.java:139)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > OperationProcessor$2.onSuccess(OperationProcessor.java:139)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > GWTRPCCommunicationProvider$5$1.$onSuccess(GWTRPCCommunicationProvider.
> java:269)
> > [frontend.jar:]
> >         at org.ovirt.engine.ui.frontend.communication.
> > GWTRPCCommunicationProvider$5$1.onSuccess(GWTRPCCommunicationProvider.
> java:269)
> > [frontend.jar:]
> >         at com.google.gwt.user.client.rpc.impl.RequestCallbackAdapter.
> > onResponseReceived(RequestCallbackAdapter.java:198) [gwt-servlet.jar:]
> >         at com.google.gwt.http.client.Request.$fireOnResponseReceived(
> Request.java:237)
> > [gwt-servlet.jar:]
> >         at com.google.gwt.http.client.RequestBuilder$1.
> onReadyStateChange(RequestBuilder.java:409)
> > [gwt-servlet.jar:]
> >         at Unknown.eval(webadmin-0.js at 65)
> >         at com.google.gwt.core.client.impl.Impl.apply(Impl.java:296)
> > [gwt-servlet.jar:]
> >         at com.google.gwt.core.client.impl.Impl.entry0(Impl.java:335)
> > [gwt-servlet.jar:]
> >         at Unknown.eval(webadmin-0.js at 54)
> >
> >
> > 2) This line seems to be about the bad disk :
> >
> >  f242cc9a-56c1-4ae4-aef0-f75eb01f74b1 | 2018-01-17 22:02:00+01 |
> > 2748779069440 | 00000000-0000-0000-0000-000000000000 |
> > 00000000-0000-0000-0000-000000000000 |           4 | 2018-01-18
> > 22:01:20.5+01   | 0dd2090c-3491-4fa1-98c3-54ae88be793c
> >
> >
> > 3) Snapshot table is empty for the concerned vm_id.
> >
> > ----- Mail original -----
> > De: "Shani Leviim" <sleviim at redhat.com>
> > À: "Lionel Caignec" <caignec at cines.fr>
> > Cc: "users" <users at ovirt.org>
> > Envoyé: Lundi 26 Février 2018 13:31:23
> > Objet: Re: [ovirt-users] Ghost Snapshot Disk
> >
> > Hi Lionel,
> >
> > The error message you've mentioned sounds like a UI error.
> > Can you please attach your ui log?
> >
> > Also, on the data from 'images' table you've uploaded, can you describe
> > which line is the relevant disk?
> >
> > Finally (for now), in case the snapshot was deleted, can you please
> > validate it by viewing the output of:
> > $ select * from snapshots;
> >
> >
> >
> > *Regards,*
> >
> > *Shani Leviim*
> >
> > On Mon, Feb 26, 2018 at 9:20 AM, Lionel Caignec <caignec at cines.fr>
> wrote:
> >
> > > Hi Shani,
> > > thank you for helping me with your reply,
> > > i juste make a little mistake on explanation. In fact it's the snapshot
> > > does not exist anymore. This is the disk(s) relative to her wich still
> > > exist, and perhaps LVM volume.
> > > So can i delete manually this disk in database? what about the lvm
> > volume?
> > > Is it better to recreate disk sync data and destroy old one?
> > >
> > >
> > >
> > > ----- Mail original -----
> > > De: "Shani Leviim" <sleviim at redhat.com>
> > > À: "Lionel Caignec" <caignec at cines.fr>
> > > Cc: "users" <users at ovirt.org>
> > > Envoyé: Dimanche 25 Février 2018 14:26:41
> > > Objet: Re: [ovirt-users] Ghost Snapshot Disk
> > >
> > > Hi Lionel,
> > >
> > > You can try to delete that snapshot directly from the database.
> > >
> > > In case of using psql  [1], once you've logged in to your database, you
> > can
> > > run this query:
> > > $ select * from snapshots where vm_id = '<your_vm_id>';
> > > This one would list the snapshots associated with a VM by its id.
> > >
> > > In case you don't have you vm_id, you can locate it by querying:
> > > $ select * from vms where vm_name = 'nil';
> > > This one would show you some details about a VM by its name (including
> > the
> > > vm's id).
> > >
> > > Once you've found the relevant snapshot, you can delete it by running:
> > > $ delete from snapshots where snapshot_id = '<your_snapshot_id>';
> > > This one would delete the desired snapshot from the database.
> > >
> > > Since it's a delete operation, I would suggest confirming the ids
> before
> > > executing it.
> > >
> > > Hope you've found it useful!
> > >
> > > [1]
> > > https://www.ovirt.org/documentation/install-guide/
> > appe-Preparing_a_Remote_
> > > PostgreSQL_Database_for_Use_with_the_oVirt_Engine/
> > >
> > >
> > > *Regards,*
> > >
> > > *Shani Leviim*
> > >
> > > On Fri, Feb 23, 2018 at 9:25 AM, Lionel Caignec <caignec at cines.fr>
> > wrote:
> > >
> > > > Hi,
> > > >
> > > > i've a problem with snapshot. On one VM i've a "snapshot" ghost
> without
> > > > name or uuid, only information is size (see attachment). In the
> > snapshot
> > > > tab there is no trace about this disk.
> > > >
> > > > In database (table images) i found this :
> > > >   f242cc9a-56c1-4ae4-aef0-f75eb01f74b1 | 2018-01-17 22:02:00+01 |
> > > > 2748779069440 | 00000000-0000-0000-0000-000000000000 |
> > > > 00000000-0000-0000-0000-000000000000 |           4 | 2018-01-18
> > > > 22:01:20.5+01   | 0dd2090c-3491-4fa1-98c3-54ae88be793c
> > > >  |           2 |             4 | 17e26476-cecb-441d-a5f7-
> 46ab3ef387ee
> > |
> > > > 2018-01-17 22:01:29.663334+01 | 2018-01-19 08:40:14.345229+01 | f
> > |
> > > >                  1 |           2
> > > >  1c7650fa-542b-4ec2-83a1-d2c1c31be5fd | 2018-01-17 22:02:03+01 |
> > > > 5368709120000 | 00000000-0000-0000-0000-000000000000 |
> > > > 00000000-0000-0000-0000-000000000000 |           4 | 2018-01-18
> > > > 22:01:20.84+01  | 0dd2090c-3491-4fa1-98c3-54ae88be793c
> > > >  |           2 |             4 | bf834a91-c69f-4d2c-b639-
> 116ed58296d8
> > |
> > > > 2018-01-17 22:01:29.836133+01 | 2018-01-19 08:40:19.083508+01 | f
> > |
> > > >                  1 |           2
> > > >  8614b21f-c0de-40f2-b4fb-e5cf193b0743 | 2018-02-09 23:00:44+01 |
> > > > 5368709120000 | 00000000-0000-0000-0000-000000000000 |
> > > > 00000000-0000-0000-0000-000000000000 |           4 | 2018-02-16
> > > > 23:00:02.855+01 | 390175dc-baf4-4831-936a-5ea68fa4c969
> > > >
> > > >
> > > >  But i does not know which line is my disk. Is it possible to delete
> > > > directly into database?
> > > >  Or is it better to dump my disk to another new and delete the
> > "corrupted
> > > > one"?
> > > >
> > > >  Another thing, when i try to move the disk to another storage
> domain i
> > > > always get "uncaght exeption occured ..." and no error in engine.log.
> > > >
> > > >
> > > >  Thank you for helping.
> > > >
> > > > --
> > > > Lionel Caignec
> > > >
> > > > _______________________________________________
> > > > Users mailing list
> > > > Users at ovirt.org
> > > > http://lists.ovirt.org/mailman/listinfo/users
> > > >
> > > >
> > >
> >
>


More information about the Users mailing list