[Beowulf] Odd AMD quad core SuperMicro power off issues
Chris Samuel
csamuel at vpac.org
Mon Jul 6 22:20:36 PDT 2009
----- "Jason Clinton" <jclinton at advancedclustering.com> wrote:
Hi Jason,
> We saw a similar power-off issue on a customer of ours who upgraded
> from 2220's to Barcelona's on a similar board; it was reproducible at
> the same failure rate on approximately 160 nodes. After trying just
> about everything under the sun, we wholesale replaced all the memory
> in the entire cluster. The power-offs ceased immediately thereafter
> and have not returned.
We saw that with Barcelona's, but instead going to the
2.3GHz (75W) Shanghai's solved the issue for us - we were
rather surprised to see it reappear with the 2.4GHz (55W)
Shanghai. :-(
cheers,
Chris
--
Christopher Samuel - (03) 9925 4751 - Systems Manager
The Victorian Partnership for Advanced Computing
P.O. Box 201, Carlton South, VIC 3053, Australia
VPAC is a not-for-profit Registered Research Agency
More information about the Beowulf
mailing list