[Beowulf] Barcelona hardware error: how to detect

Mikhail Kuzminsky kus at free.net
Thu Jun 5 11:22:36 PDT 2008


In message from "Jason Clinton" <jclinton at advancedclustering.com> 
(Thu, 5 Jun 2008 13:16:33 -0500):
>On Thu, Jun 5, 2008 at 1:09 PM, Mikhail Kuzminsky <kus at free.net> 
>wrote:
>
>> In message from Mark Hahn <hahn at mcmaster.ca> (Thu, 5 Jun 2008 
>>13:55:01
>> -0400 (EDT)):
>>
>>> I'm mystified by this: B2 was broken, so using it without the bios
>>> workaround is just a mistake or masochism.  the workaround _did_ 
>>>apparently
>>> have performance implications, but that's why B3 exists...
>>>
>>> do you mean you know of G03 problems on B2 systems which are 
>>>operating
>>> _with_ the workaround?
>>>
>>
>> I don't know exactly, but I think the crash was under absence of
>> workaround, because I was not informed that there was some kernel 
>>patches or
>> BIOS changes. This was interesting for me also, because I have no
>> information how this hardware problem may be affected in the "real 
>>life".
>>  Mikhail
>>
>
>The B2 BIOS work-around is to disable the L3 cache which gives you a 
>10-20%
>performance hit with no reduction in power consumption.
>
>The kernel patch is very extensive and, last I heard, under NDA. AMD 
>has
>said publicly that the patch gives you a 1-2% performance hit.

This URL is old, but may give some information:

https://www.x86-64.org/pipermail/discuss/2007-December/010260.html

Mikhail




More information about the Beowulf mailing list