[Beowulf] looking for a reference on failure rates

Joe Landman landman at scalableinformatics.com
Mon Mar 7 12:03:46 PST 2005


Hi Jim:

   Something I can refer to for primary literature for a paper.  If it 
is anecdotal, that may be fine as well, though I will have to treat it 
differently.

   This is largely for microprocessors, disks, networks, etc.  General 
digital equipment, with a focus on computers in clusters.

   Thanks!

Joe

Jim Lux wrote:
> At 10:01 AM 3/7/2005, Joe Landman wrote:
> 
>> Hi folks:
>>
>>   I am looking for a reference which describes failure rates of modern 
>> computer components as a function of temperature.  The usual rule of 
>> thumb is that every 10 degrees above a certain value doubles the 
>> failure rate (or decreases lifetime).  I would like to look at this 
>> analysis and refer to it for something I am working on.
>>
>>   Thanks
>>
>> Joe
> 
> How rigorous a reference?  Or a general description of failure rates vs 
> temp, for, e.g. microprocessors.
> 
> 
> 
> James Lux, P.E.
> Spacecraft Radio Frequency Subsystems Group
> Flight Communications Systems Section
> Jet Propulsion Laboratory, Mail Stop 161-213
> 4800 Oak Grove Drive
> Pasadena CA 91109
> tel: (818)354-2075
> fax: (818)393-6875
> 

-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452
cell : +1 734 612 4615




More information about the Beowulf mailing list