[Beowulf] anyone using SALT on your clusters?
lindahl at pbm.com
Fri Jun 28 12:29:33 PDT 2013
On Fri, Jun 28, 2013 at 09:45:50AM +0100, Jonathan Barber wrote:
> The problem with SSH based approaches is when you have failed nodes -
> normally they cause the entire command to hang until the attempted
> connection times out.
Normally what people do is ping the node before trying ssh on it. And
have reasonable timeouts around both the ssh connect and the command
execution. There's no fundamental reason why this is any different
from messaging or subscription-plus-messaging.
More information about the Beowulf