[Beowulf] anyone using SALT on your clusters?
john.hearns at mclaren.com
Tue Jul 2 01:32:59 PDT 2013
> Our NoSQL database uses pub-sub for cluster membership, and we found
> that the hosts with screwed up RAID controllers could easily stay in
> the cluster even if they were really screwed up. We had to add some extra
> watchdogs and tests that the system disk is working.
I've seen that before with a RAID controller when one disk fails.
System pings, OS and TCP-IP stack are up, but the system disk has been marked write-only.
I'm still pretty amazed that the Linux OS soldiers on in this state.
One to watch out for!
The contents of this e-mail are confidential and for the exclusive use of the intended recipient.
If you are not the intended recipient you should not read, copy, retransmit or disclose its contents.
If you have received this email in error please delete it from your system immediately and notify us either by email or telephone.
The views expressed in this communication may not necessarily be the views held by McLaren Racing Limited.
McLaren Racing Limited | McLaren Technology Centre | Chertsey Road | Woking | Surrey | GU21 4YH | UK | Company Number: 01517478
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Beowulf