[Beowulf] Surviving a double disk failure

Bill Broadley bill at cse.ucdavis.edu
Fri Apr 10 02:50:09 PDT 2009

Guy Coates wrote:
> Yikes, epic recovery.
>> What are the lessons learnt?
> You forgot the obvious one.

I suggest ditching silly old centos/redhat kernels and run something new
enough to allow for scrubbing.  So that all your disks don't silently start
collecting errors waiting to cascade into a lost RAID upon the first
non-silent error.

More information about the Beowulf mailing list