
The relevant path is most definitely the cause for Sunday's issue: 2017-08-30 04:38:52,542-04 ERROR [org.ovirt.engine.core.bll.RunVmOnceCommand] (default task-18) [5c4b4916-9832-4460-a391-cac53ef8f19a] Error during ValidateFailure.: java.lang.ClassCastException: org.ovirt.engine.core.common.businessentities.storage.LunDisk cannot be cast to org.ovirt.engine.core.common.businessentities.storage.DiskImage at org.ovirt.engine.core.bll.RunVmCommand.lambda$checkDisksInBackupStorage$1(RunVmCommand.java:1105) [bll.jar:] at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193) [rt.jar:1.8.0_141] at java.util.HashMap$ValueSpliterator.tryAdvance(HashMap.java:1641) [rt.jar:1.8.0_141] at java.util.stream.ReferencePipeline.forEachWithCancel(ReferencePipeline.java:126) [rt.jar:1.8.0_141] at java.util.stream.AbstractPipeline.copyIntoWithCancel(AbstractPipeline.java:498) [rt.jar:1.8.0_141] at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:485) [rt.jar:1.8.0_141] at java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:471) [rt.jar:1.8.0_141] at java.util.stream.MatchOps$MatchOp.evaluateSequential(MatchOps.java:230) [rt.jar:1.8.0_141] at java.util.stream.MatchOps$MatchOp.evaluateSequential(MatchOps.java:196) [rt.jar:1.8.0_141] at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) [rt.jar:1.8.0_141] at java.util.stream.ReferencePipeline.anyMatch(ReferencePipeline.java:449) [rt.jar:1.8.0_141] at org.ovirt.engine.core.bll.RunVmCommand.checkDisksInBackupStorage(RunVmCommand.java:1106) [bll.jar:] at org.ovirt.engine.core.bll.RunVmCommand.validate(RunVmCommand.java:1020) [bll.jar:] at org.ovirt.engine.core.bll.RunVmOnceCommand.validate(RunVmOnceCommand.java:87) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.internalValidate(CommandBase.java:848) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.executeAction(CommandBase.java:402) [bll.jar:] at org.ovirt.engine.core.bll.executor.DefaultBackendActionExecutor.execute(DefaultBackendActionExecutor.java:13) [bll.jar:] at org.ovirt.engine.core.bll.Backend.runAction(Backend.java:499) [bll.jar:] at org.ovirt.engine.core.bll.Backend.runActionImpl(Backend.java:481) [bll.jar:] at org.ovirt.engine.core.bll.Backend.runAction(Backend.java:434) [bll.jar:] On Wed, Aug 30, 2017 at 2:13 PM, Barak Korren <bkorren@redhat.com> wrote:
We have two unresolved issues that are currently causing all master OST runs to fail and preventing us from effectively finding regressions.
The 1st issue, which was already reported on Sunday, is a regression that is causing vm_run to fail and was probably introduced by this patch: https://gerrit.ovirt.org/#/c/79033/41
Here is a recent run that failed with this: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/2147/
Logs are here: http://jenkins.ovirt.org/job/ovirt-master_change-queue- tester/2147/artifact/exported-artifacts/basic-suit-master- el7/test_logs/basic-suite-master/post-002_bootstrap.py/
The 2nd issue is a new one, that seems to be causing add-host failures and I've just send another email about it. The gist of this is that I'm siing the following failures in the supervdsm logs, and vdsm is not loading:
MainThread::ERROR::2017-08-30 05:55:59,476::initializer::53::root::(_lldp_init) Failed to enable LLDP on eth1 Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/network/initializer.py", line 51, in _lldp_init Lldp.enable_lldp_on_iface(device) File "/usr/lib/python2.7/site-packages/vdsm/network/lldp/lldpad.py", line 30, in enable_lldp_on_iface lldptool.enable_lldp_on_iface(iface, rx_only) File "/usr/lib/python2.7/site-packages/vdsm/network/lldpad/lldptool.py", line 46, in enable_lldp_on_iface raise EnableLldpError(rc, out, err, iface) EnableLldpError: (1, "timeout\n'M00000001C3040000000c04eth1000badminStatus0002rx' command timed out.\n", '', 'eth1') MainThread::DEBUG::2017-08-30 05:55:59,477::cmdutils::133::root::(exec_cmd) /usr/sbin/lldptool get-lldp -i eth0 adminStatus (cwd None)
This failure does not seem to have been introduced by the platform and not oVirt code, because it also happens on code that already passed OST a few days ago.
Please avoid merging any patches except to the purpose of resolving these issues. Nothing is making it into the 'tested' and nightly snapshot repos anyway ATM.
-- Barak Korren RHV DevOps team , RHCE, RHCi Red Hat EMEA redhat.com | TRIED. TESTED. TRUSTED. | redhat.com/trusted _______________________________________________ Devel mailing list Devel@ovirt.org http://lists.ovirt.org/mailman/listinfo/devel