[ovirt-users] [4.2.2-1.el7.centos] Image locked and unending task

spfma.tech at e.mail.fr spfma.tech at e.mail.fr
Thu Mar 15 10:38:11 UTC 2018


Thanks for that quick answer !
   Yes indeed I had some connectivity troubles on this server, a strange bonding problem I am investigating on since yesterday. But with just one link, it is working ok, I have no similar errors after the ones you saw.   What can I do to really remove the task from the list ? Manual database cleanup ?   

 Le 15-Mar-2018 11:15:40 +0100, eshenitz at redhat.com a crit:   
 Thank you for sending the logs.   According to the logs, it seems that you had some connectivity issue while you tried to preview the snapshot. The preview operation rolled back but according to you failed to finish.   It seems like you still have a connectivity issue with that host ('pfm-srv-virt-1.pfm-ad.pfm.loc), try to see what happens to it.   Here is the relevant part from the log:    2018-03-14 17:00:48,652+01 ERROR [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] Heartbeat exceeded for host 'pfm-srv-virt-1.pfm-ad.pfm.loc', last response arrived 2003 ms ago. 2018-03-14 17:00:53,561+01 INFO [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] Connecting to pfm-srv-virt-1.pfm-ad.pfm.loc/10.100.1.50 2018-03-14 17:02:21,832+01 INFO [org.ovirt.engine.core.utils.transaction.TransactionSupport] (EE-ManagedThreadFactory-engine-Thread-118906) [] transaction rolled back   2018-03-14 17:02:21,836+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (EE-ManagedThreadFactory-engine-Thread-118906) [] EVENT_ID: USER_TRY_BACK_TO_SNAPSHOT_FINISH_FAILURE(99), Failed to complete Snapshot-Preview AFTER_INSTALL for VM pfm-ltsp-1. 2018-03-14 17:02:21,836+01 ERROR [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (EE-ManagedThreadFactory-engine-Thread-118906) [] [within thread]: endAction for action type TryBackToAllSnapshotsOfVm threw an exception.: java.lang.NullPointerException at org.ovirt.engine.core.bll.snapshots.SnapshotsManager.deviceCanBeRemoved(SnapshotsManager.java:463) [bll.jar:] at org.ovirt.engine.core.bll.snapshots.SnapshotsManager.attempToRestoreVmConfigurationFromSnapshot(SnapshotsManager.java:415) [bll.jar:] at org.ovirt.engine.core.bll.snapshots.TryBackToAllSnapshotsOfVmCommand.restoreVmConfigFromSnapshot(TryBackToAllSnapshotsOfVmCommand.java:204) [bll.jar:] at org.ovirt.engine.core.bll.snapshots.TryBackToAllSnapshotsOfVmCommand.endSuccessfully(TryBackToAllSnapshotsOfVmCommand.java:168) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.internalEndSuccessfully(CommandBase.java:675) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.endActionInTransactionScope(CommandBase.java:630) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.runInTransaction(CommandBase.java:1936) [bll.jar:] at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInNewTransaction(TransactionSupport.java:202) [utils.jar:] at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInRequired(TransactionSupport.java:137) [utils.jar:] at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInScope(TransactionSupport.java:105) [utils.jar:] at org.ovirt.engine.core.bll.CommandBase.endAction(CommandBase.java:495) [bll.jar:] at org.ovirt.engine.core.bll.tasks.DecoratedCommand.endAction(DecoratedCommand.java:17) [bll.jar:] at org.ovirt.engine.core.bll.tasks.CoCoAsyncTaskHelper.endAction(CoCoAsyncTaskHelper.java:353) [bll.jar:] at org.ovirt.engine.core.bll.tasks.CommandCoordinatorImpl.endAction(CommandCoordinatorImpl.java:347) [bll.jar:] at org.ovirt.engine.core.bll.tasks.CommandAsyncTask.endCommandAction(CommandAsyncTask.java:160) [bll.jar:] at org.ovirt.engine.core.bll.tasks.CommandAsyncTask.lambda$endActionIfNecessary$0(CommandAsyncTask.java:112) [bll.jar:] at org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:96) [utils.jar:] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [rt.jar:1.8.0_161] at java.util.concurrent.FutureTask.run(FutureTask.java:266) [rt.jar:1.8.0_161] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [rt.jar:1.8.0_161] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [rt.jar:1.8.0_161] at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_161] at org.glassfish.enterprise.concurrent.ManagedThreadFactoryImpl$ManagedThread.run(ManagedThreadFactoryImpl.java:250) [javax.enterprise.concurrent-1.0.jar:] at org.jboss.as.ee.concurrent.service.ElytronManagedThreadFactory$ElytronManagedThread.run(ElytronManagedThreadFactory.java:78)   2018-03-14 17:02:21,838+01 INFO [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (EE-ManagedThreadFactory-engine-Thread-118906) [] CommandAsyncTask::HandleEndActionResult: endAction for action type 'TryBackToAllSnapshotsOfVm' threw an unrecoverable RuntimeException the task will be cleared. 2018-03-14 17:02:21,841+01 INFO [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (EE-ManagedThreadFactory-engine-Thread-118906)      
 On Thu, Mar 15, 2018 at 11:35 AM,  wrote:

 Thanks for your reply.   Yesterday, I realized I was doing nothing good with some the software I planed to install in a VM, so I tried to revert to a snapshot a took just after OS installation, as I always do.   As I had added a second disk to the VM in between, I choose to revert to snapshot without taking care of the second disk contents.   But the preview operation never ended. So I restarted the engine vm but nothing changed.   This morning I tried to cleanup things, using "taskcleaner" and "unlock_entity". I could regain control over the VM, but the task is still in "finalizing" state in the GUI.   I even remove the second disk to see if it was better, but nothing.   You will find the engine logfile and the "vdsm.log" from the server the task is running on.   I am not sure how to check engine version precisely, so I queried the rpm database in the vm : ovirt-engine-4.2.2-1.el7.centos.noarch   Regards   

 Le 15-Mar-2018 10:17:59 +0100, eshenitz at redhat.com a crit:   
 Hi,   Can you please specify the version of the engine and supply the engine.log and the vdsm.log?   Moreover, can you please specify the steps that you did that led you to this issue?   Thanks,  
 On Thu, Mar 15, 2018 at 11:05 AM,  wrote:

 Hi,
   I tried to rollback to a snapshot on a VM, but the preview never ended.   The task has been running for about 15 hours, with this state :   {
 "916b67fb-8808-43d2-850c-1c12650ccc49": {
 "verb": "createVolume", 
 "code": 0, 
 "state": "finished", 
 "tag": "spm", 
 "result": {
 "uuid": "d37ca118-820f-46a3-b99b-714018ea8b42"
 }, 
 "message": "1 jobs completed successfully", 
 "id": "916b67fb-8808-43d2-850c-1c12650ccc49"
 }
}
   I just canceled it : the task list is now empty on the CLI but no change on GUI.   So I restared the engine VM, but no success.   With "/usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh" I was able to manually unlock the image, but the task is still "finalizing".   Is this a bug ?    Regards 

-------------------------------------------------------------------------------------------------
FreeMail powered by mail.fr 
_______________________________________________
 Users mailing list
Users at ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

   -- 
  Regards, Eyal Shenitzky     

-------------------------------------------------------------------------------------------------
FreeMail powered by mail.fr  

   -- 
  Regards, Eyal Shenitzky     

-------------------------------------------------------------------------------------------------
FreeMail powered by mail.fr
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20180315/3fada624/attachment.html>


More information about the Users mailing list