Discussion:
[torqueusers] Torque: Job Requesting Range of Available Nodes
Christopher Pierce
2016-10-09 18:18:18 UTC
Permalink
Hello All,

I am the maintainer of a small cluster (20 nodes) used by my computation
physics group. I use torque as our job scheduler and have run into an
interesting problem. We typically have either long term jobs that use a
single node or short term jobs that are embarrassingly parallel and
should use up as many nodes as possible. When a user requests resources
for a short term job they will check the number of available nodes and
adjust their requested node count to match that number. This becomes a
problem when long term jobs are pushed to the cluster and the short term
jobs sit in the queue because one of the nodes is now unavailable. I
was wondering if there was any mechanism for a user to submit a job that
requests a range of values for resources. IE a job that will run on as
many nodes as are available constrained to a certain range.

Thank You,

Chris Pierce
Center for Computation Nano-Science, WPI
David Beer
2016-10-10 15:36:35 UTC
Permalink
Chris,

What scheduler are you using? I think that Moab supports this, but Torque
does not have a mechanism for doing this.
Post by Christopher Pierce
Hello All,
I am the maintainer of a small cluster (20 nodes) used by my computation
physics group. I use torque as our job scheduler and have run into an
interesting problem. We typically have either long term jobs that use a
single node or short term jobs that are embarrassingly parallel and
should use up as many nodes as possible. When a user requests resources
for a short term job they will check the number of available nodes and
adjust their requested node count to match that number. This becomes a
problem when long term jobs are pushed to the cluster and the short term
jobs sit in the queue because one of the nodes is now unavailable. I
was wondering if there was any mechanism for a user to submit a job that
requests a range of values for resources. IE a job that will run on as
many nodes as are available constrained to a certain range.
Thank You,
Chris Pierce
Center for Computation Nano-Science, WPI
_______________________________________________
torqueusers mailing list
http://www.supercluster.org/mailman/listinfo/torqueusers
--
David Beer | Torque Architect
Adaptive Computing
Christopher Pierce
2016-10-13 19:04:21 UTC
Permalink
Thanks,

I'm using the Torque scheduler which would explain why I cannot find
that feature.
Post by David Beer
Chris,
What scheduler are you using? I think that Moab supports this, but Torque
does not have a mechanism for doing this.
Post by Christopher Pierce
Hello All,
I am the maintainer of a small cluster (20 nodes) used by my computation
physics group. I use torque as our job scheduler and have run into an
interesting problem. We typically have either long term jobs that use a
single node or short term jobs that are embarrassingly parallel and
should use up as many nodes as possible. When a user requests resources
for a short term job they will check the number of available nodes and
adjust their requested node count to match that number. This becomes a
problem when long term jobs are pushed to the cluster and the short term
jobs sit in the queue because one of the nodes is now unavailable. I
was wondering if there was any mechanism for a user to submit a job that
requests a range of values for resources. IE a job that will run on as
many nodes as are available constrained to a certain range.
Thank You,
Chris Pierce
Center for Computation Nano-Science, WPI
_______________________________________________
torqueusers mailing list
http://www.supercluster.org/mailman/listinfo/torqueusers
_______________________________________________
torqueusers mailing list
http://www.supercluster.org/mailman/listinfo/torqueusers
Loading...