[
https://ovirt-jira.atlassian.net/browse/OVIRT-938?page=com.atlassian.jira...
]
Barak Korren commented on OVIRT-938:
------------------------------------
Got this from the slave log in Jenkis:
{code}
ERROR: Connection terminated
ESC[8mha:AAAAWB+LCAAAAAAAAP9b85aBtbiIQSmjNKU4P08vOT+vOD8nVc8DzHWtSE4tKMnMz/PLL0ldFVf2c+b/lb5MDAwVRQxSaBqcITRIIQMEMIIUFgAAckCEiWAAAAA=ESC[0mjava.io.IOException:
Unexpected termination of the channel
at
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:50)
Caused by: java.io.EOFException
at
java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2353)
at
java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:2822)
at java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:804)
at java.io.ObjectInputStream.<init>(ObjectInputStream.java:301)
at hudson.remoting.ObjectInputStreamEx.<init>(ObjectInputStreamEx.java:48)
at
hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
at
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48)
Slave JVM has terminated. Exit signal=TERM
[12/21/16 15:23:34] [SSH] Connection closed.
{code}
Fix Jenkins slave connection dying on vdsm check_merged jobs
------------------------------------------------------------
Key: OVIRT-938
URL:
https://ovirt-jira.atlassian.net/browse/OVIRT-938
Project: oVirt - virtualization made easy
Issue Type: Bug
Components: Jenkins
Reporter: Barak Korren
Assignee: infra
Something in the vdsm build_artifacs job makes the Jenkins slave disconnect when it is
running. This in turn makes the cleanup scripts not run on the slave leaving it dirty
enough to make the next job on that slave fail.
Example of this can be seen here:
http://jenkins.ovirt.org/job/vdsm_master_check-merged-el7-x86_64/692/console
Relevant log lines:
{code}
21:49:00 Ran 44 tests in 1231.988s
21:49:00
21:49:00 OK
21:49:00 + return 0
21:49:00 sh: [13086: 1 (255)] tcsetattr: Inappropriate ioctl for device
21:49:00 Took 2464 seconds
21:49:00 ===================================
21:49:00 logout
21:49:01 Slave went offline during the build
21:49:01 ERROR: Connection was broken: java.io.IOException: Unexpected termination of the
channel
21:49:01 at
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:50)
21:49:01 Caused by: java.io.EOFException
21:49:01 at
java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2353)
21:49:01 at
java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:2822)
21:49:01 at java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:804)
21:49:01 at java.io.ObjectInputStream.<init>(ObjectInputStream.java:301)
21:49:01 at
hudson.remoting.ObjectInputStreamEx.<init>(ObjectInputStreamEx.java:48)
21:49:01 at
hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
21:49:01 at
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48)
21:49:01
21:49:01 Build step 'Execute shell' marked build as failure
21:49:01 Performing Post build task...
21:49:01 Match found for :.* : True
21:49:01 Logical operation result is TRUE
21:49:01 Running script : #!/bin/bash -x
21:49:01 echo "shell-scripts/mock_cleanup.sh"
... SNIP ...
21:49:01 Exception when executing the batch command : no workspace from node
hudson.slaves.DumbSlave[fc24-vm06.phx.ovirt.org] which is computer
hudson.slaves.SlaveComputer@30863c81 and has channel null
21:49:01 Build step 'Post build task' marked build as failure
21:49:02 ERROR: Step ?Archive the artifacts? failed: no workspace for
vdsm_master_check-merged-el7-x86_64 #692
21:49:02 ERROR: Failed to evaluate groovy script.
21:49:02 java.lang.NullPointerException: Cannot invoke method child() on null object
21:49:02 at org.codehaus.groovy.runtime.NullObject.invokeMethod(NullObject.java:77)
21:49:02 at
org.codehaus.groovy.runtime.callsite.PogoMetaClassSite.call(PogoMetaClassSite.java:45)
21:49:02 at
org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:42)
21:49:02 at
org.codehaus.groovy.runtime.callsite.NullCallSite.call(NullCallSite.java:32)
21:49:02 at
org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:42)
21:49:02 at
org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
21:49:02 at
org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
21:49:02 at Script1.run(Script1.groovy:2)
21:49:02 at groovy.lang.GroovyShell.evaluate(GroovyShell.java:580)
21:49:02 at groovy.lang.GroovyShell.evaluate(GroovyShell.java:618)
21:49:02 at groovy.lang.GroovyShell.evaluate(GroovyShell.java:589)
21:49:02 at
org.jenkinsci.plugins.scriptsecurity.sandbox.groovy.SecureGroovyScript.evaluate(SecureGroovyScript.java:166)
21:49:02 at
org.jvnet.hudson.plugins.groovypostbuild.GroovyPostbuildRecorder.perform(GroovyPostbuildRecorder.java:361)
21:49:02 at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:20)
21:49:02 at
hudson.model.AbstractBuild$AbstractBuildExecution.perform(AbstractBuild.java:782)
21:49:02 at
hudson.model.AbstractBuild$AbstractBuildExecution.performAllBuildSteps(AbstractBuild.java:723)
21:49:02 at hudson.model.Build$BuildExecution.post2(Build.java:185)
21:49:02 at
hudson.model.AbstractBuild$AbstractBuildExecution.post(AbstractBuild.java:668)
21:49:02 at hudson.model.Run.execute(Run.java:1763)
21:49:02 at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43)
21:49:02 at hudson.model.ResourceController.execute(ResourceController.java:98)
21:49:02 at hudson.model.Executor.run(Executor.java:410)
21:49:02 Build step 'Groovy Postbuild' marked build as failure
21:49:02 Started calculate disk usage of build
21:49:02 Finished Calculation of disk usage of build in 0 seconds
21:49:02 Finished: FAILURE
{code}
--
This message was sent by Atlassian JIRA
(v1000.621.5#100023)