[
https://ovirt-jira.atlassian.net/browse/OVIRT-2311?page=com.atlassian.jir...
]
Dafna Ron commented on OVIRT-2311:
----------------------------------
the errors I posted are from the engine messages log.
I see only warn messages in server log about hostkey:
https://pastebin.com/0h75Trnz
I can't see any sign that engine service went down but I can see that there is a gap
in the logs that makes it difficult.
the last log we have is post-003_basic_networking.py but the console is showing that the
issue accoured after:
ovirt-master vdsm failure - extend disk test times out
-------------------------------------------------------
Key: OVIRT-2311
URL:
https://ovirt-jira.atlassian.net/browse/OVIRT-2311
Project: oVirt - virtualization made easy
Issue Type: Bug
Reporter: Dafna Ron
Assignee: infra
Labels: ost_failures
There is no failure of a test but we can see a gap of an hour from start of extend_disk1
until unhanded exception is thrown.
Although the error is reported on 004_basic_sanity the logs stop on
post-003_basic_networking.py.
looking at the available logs it may be a task that is not cleared.
I do not see a connection between the patch and the timeout unless something in the tests
is looking ERROR: InterpreterNotFound: python3.6 errors in order to exit.
patch that failed:
https://gerrit.ovirt.org/#/c/92847/
job:
http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/8648/
Error I can see in the logs is:
2018-07-12 01:23:14,863-04 ERROR
[org.ovirt.engine.core.bll.storage.domain.AttachStorageDomainToPoolCommand] (default
task-1) [] An error occurred while fetching unregistered disks from Storage Domain id
'597bfe1f-d88c-4961-b1dc-f6c771af9
fff'
but I can't see an error in vdsm that would suggest we had a failure.
you can see the gap in the job:
05:31:45 [basic-suit] # extend_disk1:
06:31:56 [basic-suit] * Unhandled exception in <function <lambda> at
0x7f5dd403c5f0>
--
This message was sent by Atlassian Jira
(v1001.0.0-SNAPSHOT#100088)