[Beowulf] Opinions of Hyper-threading?
Mark Hahn
hahn at mcmaster.ca
Thu Feb 28 13:02:00 PST 2008
> STREAM Benchmark implementation in CUDA
> Array size (single precision)=8000000
> using 128 threads per block, 62500 blocks
> Function Rate (MB/s) Avg time Min time Max time
> Copy: 16706.3212 0.0039 0.0038 0.0044
> Scale: 16666.2770 0.0046 0.0038 0.0100
> Add: 18408.0866 0.0053 0.0052 0.0056
> Triad: 18738.6603 0.0052 0.0051 0.0055
I got
STREAM Benchmark implementation in CUDA
Array size (single precision)=8000000
using 128 threads per block, 62500 blocks
Copy: 50006.6051 0.0013 0.0013 0.0013
Scale: 50006.6051 0.0013 0.0013 0.0013
Add: 56409.8044 0.0017 0.0017 0.0017
Triad: 56409.8044 0.0017 0.0017 0.0017
on a "nVidia Corporation G80 [Quadro FX 4600] (rev a2)".
wikipedia quotes 67.2 GB/s theoretical.
it didn't matter whether the machine was in init 3 or 5, though the X
config was just an idle 1280x1024 server.
> Kudos to Nvidia for having a linux friendly toolchain that I could find,
> download, install, and compile a code with minimal hassle.
absolutely. AMD has really dropped the ball on this, even though it looks
like they at least announced availability of DP earlier...
More information about the Beowulf
mailing list