[
Date Prev][
Date Next][
Thread Prev][
Thread Next][
Date Index][
Thread Index]
[
List Home]
[ptp-dev] LML Driver for SGE/TACC
|
Hi Greg/Claudia/et al.,
I have a contribution question.
I put together a resource manager and LML driver for a couple of the XSEDE/TeraGrid machines at TACC. I'd like to contribute the LML driver to PTP, but there are a couple of things you might not like about it... :)
1. TACC's machines run Sun Grid Engine, but they have a non-standard "showq" (similar to MOAB) for obtaining the list of jobs. Moreover, they disabled collecting job scheduler information through qstat, which I believe would be the usual way to get this information from SGE. So, the LML driver isn't a generic driver for SGE: it is specific to TACC's installation.
2. The driver does not populate the node display (or whatever you call it -- the boxes that shows up in the System Monitor view) due to scalability issues. For example,
ranger.tacc.utexas.edu has 3,936 16-way compute nodes -- a total of 62,976 cores. On my Mac, refreshing that view was locking the UI for about 20 seconds.
So, my question is, (1) should I contribute my TACC-specific SGE LML driver, and (2) if not, have you thought about providing an extension point for third-party plug-ins to contribute LML drivers?
Jeff