[Beowulf] Intel MPI 2.0 mpdboot and large clusters, slow tostart up, sometimes not at all
M J Harvey
m.harvey at imperial.ac.uk
Thu Oct 5 13:42:37 PDT 2006
Hi,
> If you have a batch system that can start the MPDs, you should
> consider starting the MPI processes directly with the batch system and
> providing a separate service to provide the startup information.
You're exactly right. Intel's MPI is derived from MPICH2 and (as we use
PBSPro) OSC's mpiexec should do that job nicely, starting the MPI
processes via PBs's TM API and then speaking to them via PMI. However,
since version 2.0.1 refresh 1, Intel have used a modified (and
incompatible) PMI command set for which documentation hasn't been
forthcoming. Lacking the time to hack about, we've had to revert to
using their mpd for the time being.
Matt
--
Matt Harvey Email: m.j.harvey*imperial.ac.uk
HPC Systems Support Analyst Tel : +44 (0) 20 759 47233
Imperial College London Mob : +44 (0) 77 251 59691
http://www.imperial.ac.uk/ict/services/highperformancecomputing
More information about the Beowulf
mailing list