[Beowulf] Barcelona hardware error: how to detect
Mikhail Kuzminsky
kus at free.net
Thu Jun 5 11:09:58 PDT 2008
In message from Mark Hahn <hahn at mcmaster.ca> (Thu, 5 Jun 2008 13:55:01
-0400 (EDT)):
>>> I believe the absence of 'x' in the B3 column of the table on p 15
>>> means that it _is_ fixed in B3.
>>
>> I received just now some preliminary data about Gaussian-03 run
>>problems w/B2
>> and about absence of this problems w/B3.
>
>I'm mystified by this: B2 was broken, so using it without the bios
>workaround is just a mistake or masochism. the workaround _did_
>apparently have performance implications, but that's why B3 exists...
>
>do you mean you know of G03 problems on B2 systems which are operating
>_with_ the workaround?
I don't know exactly, but I think the crash was under absence of
workaround, because I was not informed that there was some kernel
patches or BIOS changes. This was interesting for me also, because I
have no information how this hardware problem may be affected in the
"real life".
Mikhail
More information about the Beowulf
mailing list