Impressive stream results (nvidia nforce/crush)

Bill Broadley bill at math.ucdavis.edu
Tue Feb 26 00:34:03 PST 2002


I'm building some thin clients, picked up a micro-atx abit with the
nvidia nforce/crush chipset, just for fun I ran stream, on an array
twice as big as stock:

gcc -O1
Function      Rate (MB/s)   RMS time     Min time     Max time
Copy:         715.3312       0.0895       0.0895       0.0897
Scale:        683.3954       0.0937       0.0937       0.0938
Add:          819.3646       0.1172       0.1172       0.1172
Triad:        677.4304       0.1419       0.1417       0.1422

gcc -O2
Function      Rate (MB/s)   RMS time     Min time     Max time
Copy:         715.8281       0.0895       0.0894       0.0899
Scale:        425.0063       0.1507       0.1506       0.1507
Add:          926.7837       0.1036       0.1036       0.1037
Triad:        552.4164       0.1739       0.1738       0.1742

gcc -O3
Function      Rate (MB/s)   RMS time     Min time     Max time
Copy:         716.1721       0.0895       0.0894       0.0896
Scale:        424.9328       0.1507       0.1506       0.1507
Add:          926.4625       0.1037       0.1036       0.1040
Triad:        552.5436       0.1739       0.1737       0.1740

gcc -O4 (prolly just a sampling error vs -O3)
Function      Rate (MB/s)   RMS time     Min time     Max time
Copy:         742.4336       0.0889       0.0862       0.0899
Scale:        443.9945       0.1495       0.1441       0.1507
Add:          958.3234       0.1030       0.1002       0.1037
Triad:        570.7626       0.1728       0.1682       0.1740

I was pretty impressed with the numbers, then I remember I
didn't even have a decent cpu:
model name	: AMD Duron(tm) processor
cpu MHz		: 952.172

Keep in mind this cpu has a 100 Mhz FSB (DDR) so it can only
place a request every 10 ns.  Also it doesn't have the special
memory prefetch circuitry available in the palaminos.  Although
I believe the nvidia crush chipset has similar functionality.

If you want a killer, cheap, node I'd at least take a look,
on board ethernet, video, eide, and usb.  Oh and it PXE boots
making it diskless friendly.

In any case these are some of the best numbers I've seen
for an athlon. 


-- 
Bill Broadley
Mathematics/Institute of Theoretical Dynamics
UC Davis



More information about the Beowulf mailing list