[Beowulf] p4_error
Mark Hahn
hahn at physics.mcmaster.ca
Thu Dec 29 14:33:05 PST 2005
> The following are usual errors that we have to counter every day::
>
> p11_2754:(1.148519)net_recv failed for fd=3.
> p11_22754 : p_4error net_recv read,errno=:104
> p16_2754 : p4_error : interrupt S1GSEGV:11
your program on p16 seg-faults (bad address, etc - could be your program,
some library, or marginal hardware). p11 is trying to communicate with it,
and quite sensibly reports that the socket between them has disappeared:
/usr/include/asm/errno.h:#define ECONNRESET 104 /* Connection reset by peer */
More information about the Beowulf
mailing list