[Beowulf] LAM_MPI problem on PBS

Onur Destanoğlu odestanoglu at gmail.com
Tue Aug 23 01:30:24 PDT 2005


Hi,

this is my PBS script;
#PBS -N firstscp
#PBS -l nodes=1:ppn=2
#PBS -l mem=4mb
#PBS -l walltime=1:00:00
#PBS -V
#PBS -m bea
PATH=/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin:/root/bin
export PATH
lamboot -v
mpirun -v C first
lamhalt -v

my systems /home directory is nfs shared between all nodes, so there
is onl one hosts file in user niyazi's home directory, this is the
hosts file;

node00
node01
node02
node03
node04
node05

node00 is not my execution node it only runs pbs_server and pbs_sched.

when i run the script i encounter some problems like these;

one error file;

n-1<2289> ssi:boot:base:linear: booting n0 (localhost)
n-1<2289> ssi:boot:base:linear: finished

one output file:

LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University

2294 first running on n0 (o)
Hello, I am 0 of the nodes : 1 

LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University

Shutting down LAM
hreq: received HALT_ACK from n0 (bee01.bee-hive)
LAM halted

so what's is going wrong?




More information about the Beowulf mailing list