[Beowulf] ECC Memory and Job Failures
Huw Lynes
lynesh at cardiff.ac.uk
Thu Apr 23 08:45:08 PDT 2009
Thought this might be of interest to others:
http://blog.revolution-computing.com/2009/04/blame-it-on-cosmic-rays.html
Apparently someone ran a large cluster job with both ECC and none-ECC
RAM. They consistently got the wrong answer when foregoing ECC.
I'd love to see the original data.
Thanks,
Huw
--
Huw Lynes | Advanced Research Computing
HEC Sysadmin | Cardiff University
| Redwood Building,
Tel: +44 (0) 29208 70626 | King Edward VII Avenue, CF10 3NB
More information about the Beowulf
mailing list