Bug 377592 - Slave stops executing Eclipse Platform tests
Summary: Slave stops executing Eclipse Platform tests
Status: RESOLVED WORKSFORME
Alias: None
Product: Community
Classification: Eclipse Foundation
Component: CI-Jenkins (show other bugs)
Version: unspecified   Edit
Hardware: PC Linux
: P3 blocker (vote)
Target Milestone: ---   Edit
Assignee: Eclipse Webmaster CLA
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: 377365
  Show dependency tree
 
Reported: 2012-04-25 00:14 EDT by David Williams CLA
Modified: 2012-05-11 14:51 EDT (History)
1 user (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description David Williams CLA 2012-04-25 00:14:39 EDT
Not sure what to make of this ... do our tests run too long? But, I've seen this several times, after running 6 hours or so, and running about half of our JUnit suites, the slave appears to "end everything". I have gotten back results once ... so, I do know its possible! 

Such as, see

https://hudson.eclipse.org/hudson/job/eclipse-JUnit-Linux2/56/console

FATAL: command execution failed
hudson.util.IOException2: Failed to join the process
	at hudson.Proc$RemoteProc.join(Proc.java:359)
	at hudson.Launcher$ProcStarter.join(Launcher.java:281)
	at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:82)
	at hudson.tasks.CommandInterpreter.perform(CommandInterpreter.java:58)
	at hudson.tasks.BuildStepMonitor$1.perform(BuildStepMonitor.java:19)
	at hudson.model.AbstractBuild$AbstractRunner.perform(AbstractBuild.java:630)
	at hudson.model.Build$RunnerImpl.build(Build.java:175)
	at hudson.model.Build$RunnerImpl.doRun(Build.java:137)
	at hudson.model.AbstractBuild$AbstractRunner.run(AbstractBuild.java:429)
	at hudson.model.Run.run(Run.java:1367)
	at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:46)
	at hudson.model.ResourceController.execute(ResourceController.java:88)
	at hudson.model.Executor.run(Executor.java:145)

.....

Then, further down, says 

.... 

Caused by: java.io.IOException: Unexpected termination of the channel
....

Looks like the node went offline during the build. Check the slave log for the details.

[And those "details" are ... ]

[04/24/12 23:44:31] [SSH] Connection closed. ERROR: [04/24/12 23:44:31] slave agent was terminated java.io.IOException: Unexpected termination of the channel at hudson.remoting.Channel$ReaderThread.run(Channel.java:1030) Caused by: java.io.EOFException at java.io.ObjectInputStream$BlockDataInputStream.peekByte(ObjectInputStream.java:2554) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1297) at java.io.ObjectInputStream.readObject(ObjectInputStream.java:351) at hudson.remoting.Channel$ReaderThread.run(Channel.java:1024) [04/24/12 23:44:31] [SSH] Opening SSH connection to 172.30.206.156:22.
Comment 1 David Williams CLA 2012-04-25 01:36:52 EDT
Same error just now on "slave 1" (earlier was from 6). This time after a very short time ... an hour or so. 


https://hudson.eclipse.org/hudson/view/Eclipse%20and%20Equinox/job/eclipse-JUnit-Linux2/57/console
Comment 2 David Williams CLA 2012-05-11 14:51:06 EDT
doesn't happen now ... I guess if you wait long enough, Hudson bugs fix themselves :)