andrealphus
2016-12-09 22:29:08 UTC
So as our resource user base has grown we're realizing we need to
probably introduce some type of throttling so that a single user isnt
using all the available nodes at any one time.
We're in a mixed situation where about half our users are submitting
job arrays of tiny jobs (1 or 2 processors per job) but with thousands
of jobs in the array.
and the other side where the other half of our users submit a single
job, where that single simulation might need 10-25 nodes to run (we
have a 60 node resource).
We could just set a maxnode of 20 nodes, but that doesnt seem to be a
well received idea.
Is there anyway we could two maxnode resource limits, one for single
jobs and one for job arrays?
Torque Version: 4.2.6.1
probably introduce some type of throttling so that a single user isnt
using all the available nodes at any one time.
We're in a mixed situation where about half our users are submitting
job arrays of tiny jobs (1 or 2 processors per job) but with thousands
of jobs in the array.
and the other side where the other half of our users submit a single
job, where that single simulation might need 10-25 nodes to run (we
have a 60 node resource).
We could just set a maxnode of 20 nodes, but that doesnt seem to be a
well received idea.
Is there anyway we could two maxnode resource limits, one for single
jobs and one for job arrays?
Torque Version: 4.2.6.1