[Beowulf] python2.4 error when loose MPICH2 TI with Grid Engine
forum.san at gmail.com
Fri Feb 22 00:23:04 PST 2008
Dear Reuti & members of beowulf,
I need to execute a parallel job thru grid engine.
MPICH2 is installed with Process Manager:mpd.
Added a parallel environment MPICH2 into SGE:
$ qconf -sp MPICH2
start_proc_args /share/apps/MPICH2/startmpi.sh -catch_rsh $pe_hostfile
Added this PE to the default queue: all.q.
mpdboot is done. mpd's are running on two nodes.
The script for submitting this job thru sge is:
$ cat subsamplempi.sh
#$ -S /bin/bash
#$ -N Samplejob
#$ -q all.q
#$ -pe MPICH2 4
#$ -e ERR_$JOB_NAME.$JOB_ID
#$ -o OUT_$JOB_NAME.$JOB_ID
/opt/MPI_LIBS/MPICH2-GNU/bin/mpirun -np $NSLOTS -machinefile
The job is getting submitted, but not executing. The error and output file
/usr/bin/env: python2.4: No such file or directory
$ cat OUT_Samplejob.192
Fri Feb 22 12:57:18 IST 2008
So the problem is coming for python2.4.
$ which python2.4
I googled this error. Then created a symbolic link:
# ln -sf /opt/rocks/bin/python2.4 /bin/python2.4
After this also same error is coming.
I guess the problem might be different. i.e. gridengine might not getting
the link to running mpd.
And the procedure followed by me to configure PE might be wrong.
So, I expect from you to clear my doubts and help me to resolve this error.
1. Is the PE configuration of MPICH2 + grid engine right?
2. Without Tight integration, is there a way to run a MPICh2(mpd) based job
3. In smpd-daemon based and daemonless MPICH2 tight integration, which one
4. Can we do mvapich2 tight integration with SGE? Any differences with
process managers wrt MVAPICH2?
Thanks & Best Regards,
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Beowulf