[Beowulf] Functionality of schedulers
raysonlogin at gmail.com
Wed Feb 29 23:22:12 PST 2012
Almost every cluster job scheduler supports preemption (use it as the
google keyword and you will find lots of references...).
Torque has job preemption. I have not used Torque for a while (I used
to be an OpenPBS user) so I am not the best person to answer the
question for Torque. However, if you google "job preemption"+torque,
you should be able to find some useful info.
In Grid Engine (and now Open Grid Scheduler) there is Subordinate Queues:
Condor is well-known for cycle stealing, and it also offers the
checkpoint restart library for Open Grid Scheduler/Grid Engine and
other batch systems:
(I am sure that you can integrate checkpointing with Torque so you
don't need to look for a new batch system to get what you need.)
Open Grid Scheduler / Grid Engine
Scalable Grid Engine Support Program
On Thu, Mar 1, 2012 at 1:52 AM, Jon Tegner <tegner at renget.se> wrote:
> Hi list!
> Is there any scheduler which has the functionality to automatically put
> a running job on hold when another job with higher priority is submitted?
> Preferably the state of the first job should be frozen, and saved to
> disk, so that it can be restarted again when the higher priority job has
> Is this at all possible (we are using torque/maui, and I couldn't find
> this feature there)?
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
Open Grid Scheduler - The Official Open Source Grid Engine
More information about the Beowulf