What happens with a failed node? (Scyld)

Ray Jones rjones at merl.com
Mon Feb 11 15:12:53 PST 2002

Sean Dilda writes:

> If the slave node doesn't get a ping response in 30 seconds, the node
> will reboot.  On boot it will then try to connect to the master again,
> and if there are problems it will keep rebooting until it can connect.

Is it possible to change this behavior?  We would like to set up our
cluster such that when nodes lose contact with the head, they shut
down rather than reboot.

Ray Jones

More information about the Beowulf mailing list