[Beowulf] Theoretical vs. Actual Performance
Prentice Bisbal
pbisbal at pppl.gov
Thu Feb 22 10:50:54 PST 2018
This is my source for those theoretical numbers:
http://dewaele.org/~robbe/thesis/writing/references/49747D_HPC_Processor_Comparison_v3_July2012.pdf
If those numbers are off, that makes my job a bit easier. And it looks
like you're right. In the text above the table, it does mention 2-socket
servers, and then below the table in fine print, it states
"For AMD Opteron Processors, theoretical FLOPS = Core Count x Core
Frequency x number of processors per server x 4."
Why can't the table just show single socket performance? Grrrr....
Regardless of bad marketing and graphics design, I'm still at at square
one. My system has 2 sockets, and the best I've been able to do is get
~115 GFLOPS. And that's one of the 'instaneous' values LINPACK spits out
every few seconds. At the end of test, the actual GFLOPS result is more
like 77 GLOPS:
===========================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2L2 82775 40 4 8 4924.71 7.678e+01
This is a two socket system, so that's only 27% of theoretical max.
Prentice
On 02/22/2018 01:18 PM, Dmitri Chubarov wrote:
> Hi,
>
> not sure if the 282 GFLOPS number is correct.
>
> We have 16 Bulldozer/Interlagos cores at 2.2 GHz. Each pair of cores
> forms a CMT module. The two cores in the module share an FPU with 2
> 128-bit FMAC units.
>
> In terms of double precision FLOPS it should make
> 16 * 2.2GHz * 2 double precision scalars/SIMD register * 2 FLOPS / FMA
> op = 140.8 GFLOPS
>
> It looks like 282 GFLOPS number is per a 2P node.
>
> Dima
>
> On 22 February 2018 at 21:37, Prentice Bisbal <pbisbal at pppl.gov
> <mailto:pbisbal at pppl.gov>> wrote:
>
> Beowulfers,
>
> In your experience, how close does actual performance of your
> processors match up to their theoretical performance? I'm
> investigating a performances issue on some of my nodes. These are
> older systems using AMD Opteron 6274 processors. I found
> literature from AMD stating the theoretical performance of these
> processors is 282 GFLOPS, and my LINPACK performance isn't coming
> close to that (I get approximately ~33% of that). The number I
> often hear mentioned is actual performance should be ~85%. of
> theoretical performance is that a realistic number your experience?
>
> I don't want this to be a discussion of what could be wrong at
> this point, we will get to that in future posts, I assure you!
>
> --
> Prentice
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> <mailto:Beowulf at beowulf.org> sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
> <http://www.beowulf.org/mailman/listinfo/beowulf>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20180222/60700675/attachment.html>
More information about the Beowulf
mailing list