[Beowulf] the solution for qdel fail.....
Reuti
reuti at staff.uni-marburg.de
Mon Jan 17 04:54:13 PST 2005
You may have a look at another queuingsystem: GridEngine from SUN, which
offers better choices of the control of tasks on the slave nodes. It's
no problem to shutdown just one of two jobs of the same user on a node.
It will just kill the whole process group of one of the tasks, as they
are started by a special implementation of rshd/sshd private to each
task. This works for MPICH and also MPICH2 (forker and smpd startup
method). - Reuti
Chris Samuel wrote:
> On Tue, 11 Jan 2005 02:49 am, Jerry Xu wrote:
>
>
>>Hi, William, Thank for your information. Just in case somebody still
>>need it for openPBS configuration, here is my epilogue file.it shall be
>>located in $pbshome/mom_priv/ for each node and it need to be set as
>>executable and owned by root. Some others many have better epilogue
>>scripts...
>
>
> Hmm, the only thing that worries me about that is that for those of us with
> SMP clusters it is possible for a user to have two different jobs running on
> each of the CPUs, so an epilogue script that kills all a users processes on a
> node would accidentally kill an innocent job.
>
> cheers,
> Chris
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf
mailing list