[Beowulf] 96 cores in silent and small enclosure
Mark Hahn
hahn at mcmaster.ca
Mon Apr 12 06:48:28 PDT 2010
> For instance, if the limit is the junction temperature, there's some thermal
> resistance between the reference junction and the measurement point, so if
> the chip is dissipating more, the delta T between limiting point and
> measurement point is greater.
and therefore the published max operating temp would be lower.
> work hotter or cooler. (and a lot of times, those limits might be
> determined empirically... They test a bunch of cases, and that's what gets
> published in the data sheet)
no, I believe it's standard practice to characterize individual chips
as they're produced ("binning"), with specific markings (clock, temp range)
to communicate the results.
http://support.amd.com/us/Embedded_TechDocs/43374.pdf
has a number of tables that, for instance, define a thermal profile ("V")
which requires a Tcase max of 64C at 165W (! though there aren't actually
any models listed in the doc rated to dissipate that much). all the profiles
listed have a tcase-max of between 64 and 86C.
>> if you operate the chip within spec, you should expect the lifespan
>> to be plenty long (basically indefinite, but let's say 10 years...)
>
> Maybe, maybe not. The chip life generally follows Arrhenius rule (roughly
> halving life for 10C rise), but it's hard to know what the "rated" life is,
hmm, I was assuming a lifetime warranty, but indeed, the terms are 3 years.
it would be surprising if you couldn't operate the chip within its thermal
spec, continuously, for 3 years, with low failure rates...
More information about the Beowulf
mailing list