[ OST Failure Report ] [ oVirt Master (ovirt-engine) ] [ 11-09-2018 ] [ 002_bootstrap.add_cluster ]
by Dafna Ron
Hi,
we have failures on both upgrade and basic suites for master.
The patch reported as cause is:
https://gerrit.ovirt.org/#/c/93345/10 - engine : Add finer grained
monitoring thresholds for memory consumption on Hypervisors
Ravi, can you please check this issue?
You can see the logs here:
https://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/10187/arti...
Here is the error:
2018-09-11 02:08:44,511-04 WARN
[org.ovirt.engine.core.bll.AddClusterCommand] (default task-2)
[1362d97c-9d55-40d2-9b03-ebe93ae2fe67] Validation of action 'AddCluster'
failed for user admin@internal-authz. Reasons: VAR__TYPE__CLUSTER,VAR
__ACTION__CREATE,must be greater than or equal to 1,$groups
[Ljava.lang.Class;@29a040fc,$message
{javax.validation.constraints.Min.message},$payload
[Ljava.lang.Class;@64094510,$value
1,ACTION_TYPE_FAILED_ATTRIBUTE_PATH,$path cluster.logM
axMemoryUsedThreshold,$validatedValue 0
2018-09-11 02:08:44,511-04 DEBUG
[org.ovirt.engine.core.common.di.interceptor.DebugLoggingInterceptor]
(default task-2) [1362d97c-9d55-40d2-9b03-ebe93ae2fe67] method: runAction,
params: [AddCluster, ManagementNetworkOnClusterOperationPara
meters:{commandId='e5eb6cf6-4e3d-4f50-be6a-9fe18bfc4d97', user='null',
commandType='Unknown'}], timeElapsed: 56ms
2018-09-11 02:08:44,516-04 ERROR
[org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default
task-2) [] Operation Failed: [must be greater than or equal to 1,
Attribute: cluster.logMaxMemoryUsedThreshold]
2018-09-11 02:08:44,533-04 DEBUG
[org.ovirt.engine.core.aaa.filters.SsoRestApiAuthFilter] (default task-2)
[] Entered SsoRestApiAuthFilter
Thanks,
Dafna
6 years, 2 months
Issue in host deploy
by Ravi Shankar Nori
Hi,
Host deploy is failing on master with errors. The issue seems to be that
the opcode that is returned from vdsm has Unicode null characters in it
(obtained using a break point in VdsDeploy.userVisibleLog).
"Failed to execute stage 'Initializing': Invalid response opcode
'\u0000\u0000\u0000\u0000\u0000\u0000\u0000\u0000\u0000\u0000\u0000CONFIRM'"
Resulting in below errors when the message is inserted into database.
Can someone take a look
2018-09-11 13:38:11,121-04 INFO
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(VdsDeploy) [669a0081] EVENT_ID: VDS_INSTALL_IN_PROGRESS(509), Installing
Host 192.168.122.57. Stage: Initializing.
2018-09-11 13:38:39,231-04 INFO
[org.ovirt.engine.core.utils.transaction.TransactionSupport] (VdsDeploy)
[669a0081] transaction rolled back
2018-09-11 13:38:39,231-04 ERROR
[org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) [669a0081]
Error during deploy dialog
2018-09-11 13:38:39,246-04 ERROR
[org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase]
(EE-ManagedThreadFactory-engine-Thread-11) [669a0081] Error during host
192.168.122.57 install
2018-09-11 13:38:41,791-04 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(EE-ManagedThreadFactory-engine-Thread-11) [669a0081] EVENT_ID:
VDS_INSTALL_IN_PROGRESS_ERROR(511), An error has occurred during
installation of Host 192.168.122.57: CallableStatementCallback; SQL [{call
insertauditlog(?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,
?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)}ERROR: invalid byte sequence for
encoding "UTF8": 0x00; nested exception is
org.postgresql.util.PSQLException: ERROR: invalid byte sequence for
encoding "UTF8": 0x00.
6 years, 2 months
[ OST Failure Report ] [ oVirt Master (ovirt-ansible-manageiq) ] [ 11-09-2018 ] [ metrics_bootstrap.metrics_and_log_collector ]
by Dafna Ron
Hi,
we have a failure in basic suite for test
metrics_bootstrap.metrics_and_log_collector.
We failed to configure the hosts for metrics collection which causes
failure in the test.
The change that is reported as cause of failure is:
https://github.com/oVirt/ovirt-ansible-manageiq/commit/
6697fc98f934beb9fece6d0224c0b013411b84e6
You can see the full logs here:
https://jenkins.ovirt.org/job/ovirt-4.2_change-queue-tester/3078/artifact...
error:
PLAY RECAP
*********************************************************************
lago-basic-suite-4-2-host-0 : ok=1 changed=0 unreachable=0
failed=1
lago-basic-suite-4-2-host-1 : ok=1 changed=0 unreachable=0
failed=1
localhost : ok=53 changed=24 unreachable=0
failed=0
2018-09-11 09:06:18,713::ssh.py::ssh::96::lago.ssh::DEBUG::Command d69e71be
on lago-basic-suite-4-2-engine errors:
[WARNING]: Could not match supplied host pattern, ignoring: metrics_store
2018-09-11
09:06:18,713::utils.py::_ret_via_queue::60::lago.utils::ERROR::Error while
running thread
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/lago/utils.py", line 58, in
_ret_via_queue
queue.put({'return': func()})
File
"/home/jenkins/workspace/ovirt-4.2_change-queue-tester/ovirt-system-tests/basic-suite-4.2/test-scenarios/003_00_metrics_bootstrap.py",
line 54, in configure_metrics
' Exit code is %s' % result.code
File "/usr/lib/python2.7/site-packages/nose/tools/trivial.py", line 29,
in eq_
raise AssertionError(msg or "%r != %r" % (a, b))
AssertionError: Configuring ovirt machines for metrics failed. Exit code is
2
2018-09-11 09:06:38,897::ssh.py::ssh::81::lago.ssh::DEBUG::Command d648c70a
on lago-basic-suite-4-2-engine returned with 0
2018-09-11 09:06:38,897::ssh.py::ssh::89::lago.ssh::DEBUG::Command d648c70a
on lago-basic-suite-4-2-engine output:
This command will collect system configuration and diagnostic
information from this system.
6 years, 2 months
Fwd: [rhev-devel] Error Message in HE master suite: "Operation Failed". Fault detail is "[Cannot add Host. You are using an unmanaged hosted engine VM ....
by Ehud Yonasi
Hey,
I see failures on hosted-engine and when i was looking at the
build-artifacts jobs of he-setup:
http://jenkins.ovirt.org/job/ovirt-hosted-engine-setup_master_build-artif...
I can see a lot of triggering of old patches, so as far as i understand you
restored that repository.
Is the error below is the root cause of that recovery?
Would appreciate a quick answer because we need to merge a patch to fix CQ
failures because of that.
Thanks in advance!
Ehud Yonasi.
---------- Forwarded message ---------
From: Yaniv Kaul <ykaul(a)redhat.com>
Date: Tue, Sep 4, 2018 at 10:03 AM
Subject: Re: [rhev-devel] Error Message in HE master suite: "Operation
Failed". Fault detail is "[Cannot add Host. You are using an unmanaged
hosted engine VM ....
To: Galit Rosenthal <grosenth(a)redhat.com>
Cc: rhev-devel <rhev-devel(a)redhat.com>
Well, looking at engine.log[1], there's NPE around:
ERROR [org.ovirt.engine.core.vdsbroker.monitoring.FullListAdapter]
(EE-ManagedThreadFactory-engineScheduled-Thread-74) [57b53db9] Failed
during parsing configuration of VM b9c1bf34-c42f-4999-97d4-8d4895826abd
(<domain type='kvm' id='1'>
<name>HostedEngine</name>
...
</domain>
), error is: {}: java.lang.NullPointerException
at
org.ovirt.engine.core.vdsbroker.libvirt.VmConverter.extractCoreGeneralInfo(VmConverter.java:47)
[vdsbroker.jar:]
at
org.ovirt.engine.core.vdsbroker.libvirt.VmConverter.convert(VmConverter.java:36)
[vdsbroker.jar:]
at
org.ovirt.engine.core.vdsbroker.monitoring.FullListAdapter.extractCoreInfo(FullListAdapter.java:86)
[vdsbroker.jar:]
at
org.ovirt.engine.core.vdsbroker.monitoring.FullListAdapter.lambda$dumpXmls$0(FullListAdapter.java:54)
[vdsbroker.jar:]
at
java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193)
[rt.jar:1.8.0_181]
at
java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1382)
[rt.jar:1.8.0_181]
Y.
[1]
https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-te...
On Tue, Sep 4, 2018 at 9:21 AM, Galit Rosenthal <grosenth(a)redhat.com> wrote:
>
> Hi,
>
> This is an example of a he basic suite master that fails on :
>
> Error Message:
>
> *00:26:36* Error while running thread
>
> *00:26:36* Traceback (most recent call last):
>
> *00:26:36* File "/usr/lib/python2.7/site-packages/lago/utils.py", line 58, in _ret_via_queue
>
> *00:26:36* queue.put({'return': func()})
>
> *00:26:36* File "/home/jenkins/workspace/ovirt-system-tests_he-basic-suite-master/ovirt-system-tests/he-basic-suite-master/test-scenarios/002_bootstrap.py", line 379, in _add_he_host_4
>
> *00:26:36* deploy_hosted_engine=True,
>
> *00:26:36* File "/usr/lib64/python2.7/site-packages/ovirtsdk4/services.py", line 11530, in add
>
> *00:26:36* return self._internal_add(host, headers, query, wait)
>
> *00:26:36* File "/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py", line 232, in _internal_add
>
> *00:26:36* return future.wait() if wait else future
>
> *00:26:36* File "/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py", line 55, in wait
>
> *00:26:36* return self._code(response)
>
> *00:26:36* File "/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py", line 229, in callback
>
> *00:26:36* self._check_fault(response)
>
> *00:26:36* File "/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py", line 132, in _check_fault
>
> *00:26:36* self._raise_error(response, body)
>
> *00:26:36* File "/usr/lib64/python2.7/site-packages/ovirtsdk4/service.py", line 118, in _raise_error
>
> *00:26:36* raise error
>
> *00:26:36* Error: Fault reason is "Operation Failed". Fault detail is "[Cannot add Host. You are using an unmanaged hosted engine VM. Please add the first storage domain in order to start the hosted engine import process.]". HTTP response code is 409.
>
>
>
> https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-te...
>
> Who can help me with this issue?
>
> Thanks,
> Galit
>
> --
>
> GALIT ROSENTHAL
>
> SOFTWARE ENGINEER
>
> Red Hat
>
> <https://www.redhat.com/>
>
> galit(a)gmail.com T: 972-9-7692230
> <https://red.ht/sig>
>
6 years, 2 months
Failing OST Suites
by Anton Marchukov
Hello All.
This is a reminder to corresponding maintainers to check the OST suits
[1], there are failing ones such as:
ovirt-system-tests_ansible-suite-master
ovirt-system-tests_compat-3.6-suite-master
ovirt-system-tests_compat-4.0-suite-master
ovirt-system-tests_compat-4.1-suite-master
ovirt-system-tests_compat-4.2-suite-master
ovirt-system-tests_hc-basic-suite-4.2
ovirt-system-tests_he-node-ng-suite-master
If everything is not needed we might also consider disabling/deleting.
Thanks.
[1] https://jenkins.ovirt.org/view/oVirt%20system%20tests/
6 years, 2 months