Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[ptp-dev] LML Driver for SGE/TACC

Hi Greg/Claudia/et al.,

I have a contribution question.

I put together a resource manager and LML driver for a couple of the XSEDE/TeraGrid machines at TACC.  I'd like to contribute the LML driver to PTP, but there are a couple of things you might not like about it... :)

1. TACC's machines run Sun Grid Engine, but they have a non-standard "showq" (similar to MOAB) for obtaining the list of jobs.  Moreover, they disabled collecting job scheduler information through qstat, which I believe would be the usual way to get this information from SGE.  So, the LML driver isn't a generic driver for SGE: it is specific to TACC's installation.

2. The driver does not populate the node display (or whatever you call it -- the boxes that shows up in the System Monitor view) due to scalability issues.  For example, ranger.tacc.utexas.edu has 3,936 16-way compute nodes -- a total of 62,976 cores.  On my Mac, refreshing that view was locking the UI for about 20 seconds.

So, my question is, (1) should I contribute my TACC-specific SGE LML driver, and (2) if not, have you thought about providing an extension point for third-party plug-ins to contribute LML drivers?

Jeff

Back to the top