Dear all<br><br> We have a problem of running application that are complied with MPICH. Our Setup is a 16 Node 72 Cpu AMD Opteron cluster which has Rocks-4.1.2 and RHEL 4.0 update 4 installed in it. <br> <br> We are trying to run a benchmark with MPICH which came along with the ROCKS installation. the run starts and then the following error occurs after sometime.
<br><br>" p1_8544: p4_error: Timeout in Establishing connection to remote process: 0 "<br>rm_l_1_8667: (359.417969) net_send: could not write to fd=5, errno=104<br><br>We have been trying the same for the past two days and we didnt get any solution for the above.
<br> <br>Also we downloaded the Latest MPICH 1.2.7p1 and configured the same. now for the same testing with the latest mpich, the code seems to be running in the Master Server no matter, how many number of processors we give.
<br><br>The same testing with LAM/MPI and OPENMPI are working fine. pls provide us a good solution<br>-- <br>Thanks and Regards<br><br>R.Vadivelan<br>CMC Ltd,<br>Bangalore<br><a href="mailto:r.vadivelanrhce@gmail.com">r.vadivelanrhce@gmail.com
</a>