[Beowulf] MPICH fault handling
gvinodh1980 at yahoo.co.in
Sat Oct 30 00:38:35 PDT 2004
i established a four node beowulf cluster using
while testing, i started mpd daemon in all the nodes
from the master by mpdboot, then i unplugged one slave
node from LAN, and now i tried to execute a program
using mpiexec, the master node is not recognising that
one of the node has failed.
then i checked in www.beowulf.org - Archives, the last
discussion about the mpi node failure was at Jan -
so now i want to know, whether there is any update of
MPI fault handling.
what can i do if
1. any slave node fails.
2. master node fails.
Do you Yahoo!?
Yahoo! Mail Address AutoComplete - You start. We finish.
More information about the Beowulf