[Beowulf] ECC Memory and Job Failures

Huw Lynes lynesh at cardiff.ac.uk
Thu Apr 23 08:45:08 PDT 2009


Thought this might be of interest to others:

http://blog.revolution-computing.com/2009/04/blame-it-on-cosmic-rays.html

Apparently someone ran a large cluster job with both ECC and none-ECC
RAM. They consistently got the wrong answer when foregoing ECC.

I'd love to see the original data.

Thanks,
Huw

-- 
Huw Lynes                       | Advanced Research Computing
HEC Sysadmin                    | Cardiff University
                                | Redwood Building, 
Tel: +44 (0) 29208 70626        | King Edward VII Avenue, CF10 3NB





More information about the Beowulf mailing list