On Wed, Nov 25, 2020 at 1:02 PM Yedidyah Bar David <didi(a)redhat.com> wrote:
Hi all,
I ran OST manually with suite he-basic-master, and it failed at a
rather late stage (010_local_maintenance_cli. My changes were for
008_restart_he_vm, which passed). It failed because the hosted-engine
VM was already on host-1, which wasn't expected. I spent quite some
time trying to understand this, and I *think* the root cause was quite
before that. engine.log:
2020-11-25 09:48:50,864+01 INFO
[org.ovirt.engine.core.bll.network.host.HostSetupNetworksCommand]
(EE-ManagedThreadFactory-engine-Thread-273) [2e1072c7] Host setup
networks finished. Lock released. Monitoring can run now for host
'lago-he-basic-suite-master-host-0.lago.local' from data-center
'Default'
2020-11-25 09:48:50,866+01 INFO
[org.ovirt.engine.core.bll.network.host.HostSetupNetworksCommand]
(EE-ManagedThreadFactory-engine-Thread-273) [2e1072c7] Lock freed to
object
'EngineLock:{exclusiveLocks='[HOST_NETWORKfba14f41-7ece-46ca-af65-7476e9fd2018=HOST_NETWORK]',
sharedLocks=''}'
2020-11-25 09:48:50,923+01 INFO
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(EE-ManagedThreadFactory-engine-Thread-273) [2e1072c7] EVENT_ID:
PERSIST_NETWORK_ON_HOST_FINISHED(1,146), (1/2): Successfully applied
changes for network(s) Labeled_Network on host
lago-he-basic-suite-master-host-0.lago.local. (User:
admin@internal-authz)
2020-11-25 09:48:53,883+01 INFO
[org.ovirt.engine.core.bll.ConcurrentChildCommandsExecutionCallback]
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-47)
[2e1072c7] Command 'PropagateNetworksToClusterHosts' id:
'6e61f53b-c2b7-4a35-a840-67076e6d66cb' child commands
'[fe7f022e-c13b-4924-a6ea-cc56407e6969,
ca990d6e-199b-473d-928d-0a7e6d8cf073]' executions were completed,
status 'SUCCEEDED'
2020-11-25 09:48:54,891+01 INFO
[org.ovirt.engine.core.bll.network.cluster.PropagateNetworksToClusterHostsCommand]
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-85)
[2e1072c7] Ending command
'org.ovirt.engine.core.bll.network.cluster.PropagateNetworksToClusterHostsCommand'
successfully.
2020-11-25 09:49:24,749+01 ERROR
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring]
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-92)
[] Failure on checkInterfaces on update runtime info for host
'lago-he-basic-suite-master-host-0.lago.local':
PreparedStatementCallback; uncategorized SQLException for SQL [select
* from getallnetworkbyclusterid(?, ?, ?)]; SQL state [null]; error
code [0]; IJ031013: Interrupted attempting lock:
org.jboss.jca.adapters.jdbc.local.LocalManagedConnection@7d88842;
nested exception is java.sql.SQLException: IJ031013: Interrupted
attempting lock:
org.jboss.jca.adapters.jdbc.local.LocalManagedConnection@7d88842
2020-11-25 09:49:24,752+01 ERROR
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring]
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-92)
[] ResourceManager::refreshVdsRunTimeInfo:
host=lago-he-basic-suite-master-host-0.lago.local(fba14f41-7ece-46ca-af65-7476e9fd2018):
PreparedStatementCallback; uncategorized SQLException for SQL [select
* from getvdsdynamicbyvdsid(?)]; SQL state [null]; error code [0];
IJ031013: Interrupted attempting lock:
org.jboss.jca.adapters.jdbc.local.LocalManagedConnection@7d88842;
nested exception is java.sql.SQLException: IJ031013: Interrupted
attempting lock:
org.jboss.jca.adapters.jdbc.local.LocalManagedConnection@7d88842
2020-11-25 09:49:24,761+01 ERROR
[org.ovirt.engine.core.vdsbroker.VdsManager]
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-92)
[] Timer update runtime info failed. Exception: SQLException:
IJ031013: Interrupted attempting lock:
org.jboss.jca.adapters.jdbc.local.LocalManagedConnection@7d88842
Any idea?
Filed for this:
https://bugzilla.redhat.com/show_bug.cgi?id=1901471
Best regards,
--
Didi