Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

broken pipe at MPI startup

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Daniel Ridge newt at scyld.com
Thu Jan 11 08:23:59 PST 2001


What version of MPI are you running and on what platform?

On Wed, 10 Jan 2001, Qian Peng wrote:

> I had a small cluster of 6 duals and recently doubled the size of it.  A
> program once ran fine on the 12 processors.  Now when I run it on all 24
> processors, I will occasionally get "Command terminated on signal 13" error
> at the mpirun level.  The broken pipe is when mpirun is trying to start the
> executables.  If I only use 12 processors, whether from 6 nodes or use one
> processor each from all 12 nodes, I cannot make this error happen.  I'm
> using mpirun with ssh.  It seems to be random when and on which node this
> error occurs.  Any insights on what may the possible causes be?  Thanks,

Regards,
	Dan Ridge
	Scyld Computing Corporation





More information about the Beowulf mailing list