[Beowulf] bizarre scaling behavior on a Nehalem
kus at free.net
Fri Aug 14 16:24:25 PDT 2009
In message from Bill Broadley <bill at cse.ucdavis.edu> (Fri, 14 Aug 2009
>Mikhail Kuzminsky wrote:
>>> Your results look excellent, so I wouldn't be surprised if they are
>>> running at 1333.
>> I have 12-18 GB/s on 4 threads of stream/ifort w/DDR3-1066 on dual
>> server. But it works under "numa-bad" kernel w/o control of
>> numa-efficient allocation.
>Sounds pretty bad.
>Why 4 threads? You need 8 cores to keep all 6 memory busses busy.
For comparison w/your tests: you have only 4 cores. On 8 threads I
have 20-26 GB/s.
ifort pointed above means intel fortran 11.0.38.
> open64 does substantially better than gcc.
>üÔÏ ÓÏÏÂÝÅÎÉÅ ÂÙÌÏ ÐÒÏ×ÅÒÅÎÏ ÎÁ ÎÁÌÉÞÉÅ × ÎÅÍ ×ÉÒÕÓÏ×
>É ÉÎÏÇÏ ÏÐÁÓÎÏÇÏ ÓÏÄÅÒÖÉÍÏÇÏ ÐÏÓÒÅÄÓÔ×ÏÍ
>MailScanner, É ÍÙ ÎÁÄÅÅÍÓÑ
>ÞÔÏ ÏÎÏ ÎÅ ÓÏÄÅÒÖÉÔ ×ÒÅÄÏÎÏÓÎÏÇÏ ËÏÄÁ.
More information about the Beowulf