[scyld-users] bpsh in background: defunct processes
Donald Becker
becker at scyld.com
Mon Sep 29 18:32:10 PDT 2003
On Mon, 29 Sep 2003, Anand Bedekar wrote:
> I'm trying to run bpsh in a script that calls bpsh in
> a loop, like this:
>
> for i in 1 2 3
> do
> bpsh -n $i run.sh &
> done
Suggestion: you should be using 'beomap' to get a dynamic schedule:
for i in `beomap --np 3`; do ...
> What happens is that all the processes called within
> run.sh seem to go into a "defunct" state without
> finishing cleanly. This is making the process table
> fill up, so that no more processes can be run.
This sounds like a long-fixed bug in the BProc. The status and
termination messages were being processed in reverse order.
> Is this usual behaviour when calling bpsh to run a
> shell script, given the way I am calling
> 'bpsh -n $i run.sh &' ? Is there some other way to run
> it?
With our new release there is a command named 'beorun' that
automatically combines a scheduler mapping with efficiently controlling
the resulting processes:
beorun --np 3 command;
> Unfortunately all the nodes in the cluster are
> currently out of action because the process table is
> full on all of them, due to the above.
You should be able to restart the cluster nodes in about a second...
> So I can't report on which version of scyld has been installed,
> until the sysadmin reboots the whole thing. I do know
> the machines are P3 running RedHat 7.0, kernel version
> 2.2.19.
That doesn't sound like a Scyld release.
--
Donald Becker becker at scyld.com
Scyld Computing Corporation http://www.scyld.com
914 Bay Ridge Road, Suite 220 Scyld Beowulf cluster system
Annapolis MD 21403 410-990-9993
More information about the Scyld-users
mailing list