Bug 535294 - Display ports are not available on hippcentos
Summary: Display ports are not available on hippcentos
Status: RESOLVED FIXED
Alias: None
Product: Community
Classification: Eclipse Foundation
Component: CI-Jenkins (show other bugs)
Version: unspecified   Edit
Hardware: PC Linux
: P1 critical (vote)
Target Milestone: ---   Edit
Assignee: CI Admin Inbox CLA
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-05-29 13:31 EDT by Sravan Kumar Lakkimsetti CLA
Modified: 2018-06-01 04:05 EDT (History)
4 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Sravan Kumar Lakkimsetti CLA 2018-05-29 13:31:25 EDT
We are seeing error as all vnc ports are allocated or black listed

https://ci.eclipse.org/releng/view/Automated%20tests/job/ep48I-unit-cen64-gtk3-java9/502/console
Comment 1 Mikaël Barbero CLA 2018-05-30 02:47:52 EDT
Done.
Comment 2 Sravan Kumar Lakkimsetti CLA 2018-05-30 04:39:55 EDT
Restarting did not help. Looks like a different issue

Here is the exact log

FATAL: All available display numbers are allocated or blacklisted.
allocated: [1490, 1491, 1492, 1493, 1494, 1495, 1496, 1497, 1498, 1499]
blacklisted: []
java.lang.RuntimeException: All available display numbers are allocated or blacklisted.
allocated: [1490, 1491, 1492, 1493, 1494, 1495, 1496, 1497, 1498, 1499]
blacklisted: []
	at hudson.plugins.xvnc.DisplayAllocator.doAllocate(DisplayAllocator.java:59)
	at hudson.plugins.xvnc.DisplayAllocator.allocate(DisplayAllocator.java:49)
	at hudson.plugins.xvnc.Xvnc.doSetUp(Xvnc.java:106)
	at hudson.plugins.xvnc.Xvnc.setUp(Xvnc.java:96)
	at jenkins.tasks.SimpleBuildWrapper.setUp(SimpleBuildWrapper.java:146)
	at hudson.model.Build$BuildExecution.doRun(Build.java:157)
	at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:504)
	at hudson.model.Run.execute(Run.java:1727)
	at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43)
	at hudson.model.ResourceController.execute(ResourceController.java:97)
	at hudson.model.Executor.run(Executor.java:429)

Restarting the machine should have restarted VNC also. I am not sure about the exact problem here. We are still facing the issue.
Comment 3 Dani Megert CLA 2018-05-30 05:11:35 EDT
See also bug 535090.
Comment 4 Dani Megert CLA 2018-05-30 10:22:52 EDT
Is this fixed now?
Comment 5 Mikaël Barbero CLA 2018-05-30 11:48:14 EDT
I can't find anything wrong in there. Sravan, do you confirm it's back working?
Comment 6 Sravan Kumar Lakkimsetti CLA 2018-05-31 02:07:29 EDT
I still reproduce this issue

effectively we have only 1 executor for linux tests on releng hipp.

https://hudson.eclipse.org/releng/view/Automated%20tests/job/ep48I-unit-cen64-gtk3/397/console
https://hudson.eclipse.org/releng/view/Automated%20tests/job/ep48I-unit-cen64-gtk3-java9/507/console
Comment 7 Eclipse Webmaster CLA 2018-05-31 14:56:45 EDT
I don't see anything in the system logs around the times in question, so it seems like it may be a Jenkins issue.  I have restarted hippcentos just in case it was holding on to something.

-M.
Comment 8 Sravan Kumar Lakkimsetti CLA 2018-06-01 02:16:04 EDT
Right now we cannot run any tests. Till yesterday we had 1 effective executor. now none of the executors work.

https://hudson.eclipse.org/releng/view/Automated%20tests/job/ep48I-unit-cen64-gtk3/403/console
Comment 9 Mikaël Barbero CLA 2018-06-01 03:11:16 EDT
I ran 

import jenkins.*
import jenkins.model.Jenkins

Jenkins jenkins = Jenkins.getActiveInstance();
xvncDescriptor = jenkins.getDescriptorByType(hudson.plugins.xvnc.Xvnc.DescriptorImpl.class)

xvncDescriptor.allocators.each {
  allocator = it.value
  // collect is used to make sure numAlloc is an entire new list and not just a reference to the same list object, otherwise you'll get a
  // concurrentmodification exception
  numAlloc = allocator.allocatedNumbers.collect()

  numAlloc.each {
    allocator.allocatedNumbers.remove(it)
  }
}

please try again.
Comment 10 Sravan Kumar Lakkimsetti CLA 2018-06-01 04:05:39 EDT
(In reply to Mikaël Barbero from comment #9)
> I ran 
> 
> import jenkins.*
> import jenkins.model.Jenkins
> 
> Jenkins jenkins = Jenkins.getActiveInstance();
> xvncDescriptor =
> jenkins.getDescriptorByType(hudson.plugins.xvnc.Xvnc.DescriptorImpl.class)
> 
> xvncDescriptor.allocators.each {
>   allocator = it.value
>   // collect is used to make sure numAlloc is an entire new list and not
> just a reference to the same list object, otherwise you'll get a
>   // concurrentmodification exception
>   numAlloc = allocator.allocatedNumbers.collect()
> 
>   numAlloc.each {
>     allocator.allocatedNumbers.remove(it)
>   }
> }
> 
> please try again.

Tests were successfully started now. It will take another 5-6 hours before they complete. I will reopen if I get any other errors