ch_p4 Error -> System Hangs

Chadalavada Kalyana Krishna kalyanakrishna at yahoo.com
Tue Nov 6 01:51:11 PST 2001


Hello all,

I am working on a 7 node Linux Cluster ( 6 compute
nodes , 1 FS).  I tried to run simple Hello World
Program. The C Program went through with out any
glitches. When I tried the same in FORTRAN, the
system from which the program was started, hung. I
could not trace out the source to any s/w problem or
installation, though I am not sure about it.

Repeated attempts to run the same resulted in hanging
of n09, n11, n13,n14, n15. I was not able to Ping to
the systems. But, I also do not understand why n10 did
not hang though I ran the program there too.

Ths display is :

Code: some numbres.

Alicee: Killed Interrupt handler
Kernel Panic: Interrupt Handler not syncing

One important point is that we have configured mpich
to use ssh instead of rsh for communication.

with reagrds,

Kalyan.Ch

=====
------------------------------------------------------------
Ch.Kalyana Krishna,
Parallel Processing Group,
National PARAM Super Computing Facility, Center for Development of Advanced Computing,
Pune University Campus,Pune - 411 007, India.
Ph: Off:+91-20-5694080 Res: +91-20-589255

__________________________________________________
Do You Yahoo!?
Find a job, post your resume.
http://careers.yahoo.com



More information about the Beowulf mailing list