HI, can any one give me references for Fault Tolerant MPI programs with MPICH ? I want information about how to checkpoint MPI programs in MPICH and possible Fault Tolerant architecture for MPI programs in MPICH with regds.. gcr