[Beowulf] Optimal BIOS settings for Tyan K8SRE

Bruce Allen ballen at gravity.phys.uwm.edu
Mon Sep 4 17:49:28 PDT 2006


PS: be sure to use the 'mcelog' utility and package to monitor for ECC 
errors.  If you have a large number of nodes this will help to identify 
flaky memory and cpus with cache memory issues.

On Mon, 4 Sep 2006, stephen mulcahy wrote:

> Hi Bruce,
>
> Do you have any idea what the performance impact from enabling scrubbing
> is on your systems? did you do any before/after benchmarking?
>
> Thanks,
>
> -stephen
>
> Bruce Allen wrote:
>> On Sun, 3 Sep 2006, Mark Hahn wrote:
>>
>>>>     ECC Features
>>>>         ECC    Enabled
>>>>         ECC Scrub Redirection    Enabled
>>>>         Dram ECC Scrub CTL    Disabled
>>>>         Chip-Kill    Disabled
>>>>         DCACHE ECC Scrub CTL    Disabled
>>>>         L2 ECC Scrub CTL    Disabled
>>
>> You can find our systems BIOS/ECC/Scrub settings here:
>> http://www.lsc-group.phys.uwm.edu/beowulf/nemo/construction/BIOS/bios_settings.txt
>>
>> Our systems are Supermicro H8SSL-i motherboards, with a
>> Serverworks/Broadcom HT1000 chipset and a single Opteron 175 (dual core,
>> 2.2 GHz).
>>
>> The ECC part is:
>>  DRAM ECC Enable = Enabled
>>  MCA DRAM ECC Logging = Enabled
>>  DRAM Scrub Redirect = Enabled
>>  DRAM BG Scrub = 2.62ms
>>  L2 Cache BG Scrub = 84.00ms
>>  Data Cache BG Scrub = 84.00ms
>>
>> Scrubbing is done one cache line (64) bytes at a time.  Thus with 2GB of
>> memory and DRAM background scrub interval of 2.62ms we will scrub the
>> entire memory in approximately:
>>
>> 2 GB/64 Bytes * 2.62 ms = 2^31 / 2^6 * 2.62 ms = 87912 secs
>>
>> So our choices correspond to one complete scrub of DRAM per day.  Our
>> settings scrub the L2 cache more often: about once every half hour.
>> Just modify the calculation above, using 1MB instead of 2GB, and 84 ms
>> instead of 2.62 ms.  One finds that the L2 cache is scrubbed about once
>> every 1376 seconds (every 23 minutes).
>>
>> Cheers,
>>     Bruce
>
>



More information about the Beowulf mailing list