[Beowulf] LAM_MPI problem on PBS
Onur Destanoğlu
odestanoglu at gmail.com
Tue Aug 23 01:30:24 PDT 2005
Hi,
this is my PBS script;
#PBS -N firstscp
#PBS -l nodes=1:ppn=2
#PBS -l mem=4mb
#PBS -l walltime=1:00:00
#PBS -V
#PBS -m bea
PATH=/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin:/root/bin
export PATH
lamboot -v
mpirun -v C first
lamhalt -v
my systems /home directory is nfs shared between all nodes, so there
is onl one hosts file in user niyazi's home directory, this is the
hosts file;
node00
node01
node02
node03
node04
node05
node00 is not my execution node it only runs pbs_server and pbs_sched.
when i run the script i encounter some problems like these;
one error file;
n-1<2289> ssi:boot:base:linear: booting n0 (localhost)
n-1<2289> ssi:boot:base:linear: finished
one output file:
LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University
2294 first running on n0 (o)
Hello, I am 0 of the nodes : 1
LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University
Shutting down LAM
hreq: received HALT_ACK from n0 (bee01.bee-hive)
LAM halted
so what's is going wrong?
More information about the Beowulf
mailing list