broken pipe at MPI startup
Daniel Ridge
newt at scyld.com
Thu Jan 11 08:23:59 PST 2001
What version of MPI are you running and on what platform?
On Wed, 10 Jan 2001, Qian Peng wrote:
> I had a small cluster of 6 duals and recently doubled the size of it. A
> program once ran fine on the 12 processors. Now when I run it on all 24
> processors, I will occasionally get "Command terminated on signal 13" error
> at the mpirun level. The broken pipe is when mpirun is trying to start the
> executables. If I only use 12 processors, whether from 6 nodes or use one
> processor each from all 12 nodes, I cannot make this error happen. I'm
> using mpirun with ssh. It seems to be random when and on which node this
> error occurs. Any insights on what may the possible causes be? Thanks,
Regards,
Dan Ridge
Scyld Computing Corporation
More information about the Beowulf
mailing list