broken pipe at MPI startup
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Daniel Ridge newt at scyld.comThu Jan 11 08:23:59 PST 2001
- Previous message: broken pipe at MPI startup
- Next message: channel bonding
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
What version of MPI are you running and on what platform? On Wed, 10 Jan 2001, Qian Peng wrote: > I had a small cluster of 6 duals and recently doubled the size of it. A > program once ran fine on the 12 processors. Now when I run it on all 24 > processors, I will occasionally get "Command terminated on signal 13" error > at the mpirun level. The broken pipe is when mpirun is trying to start the > executables. If I only use 12 processors, whether from 6 nodes or use one > processor each from all 12 nodes, I cannot make this error happen. I'm > using mpirun with ssh. It seems to be random when and on which node this > error occurs. Any insights on what may the possible causes be? Thanks, Regards, Dan Ridge Scyld Computing Corporation
- Previous message: broken pipe at MPI startup
- Next message: channel bonding
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
