[Beowulf] AMD64 results...

Bill Broadley bill at cse.ucdavis.edu
Wed Dec 15 22:59:52 PST 2004


> ironically, icc -xW generates pretty good-for-opteron code,
> though of course, it's 32b.  I haven't tried using icc to
> generate em64t/and64 code.
> 
> regards, mark hahn.

Ah, got icc-8.1 to cooperate, dual 2.2 Ghz opteron+pc3200+2.4 kernel,
915.5MB array:
-O1
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2285.8039       0.2640       0.2800       0.3200
Scale:       2206.9798       0.2690       0.2900       0.3000
Add:         2341.5554       0.3740       0.4100       0.4200
Triad:       2181.9031       0.4060       0.4400       0.4800

-O2
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2370.4856       0.2570       0.2700       0.3400
Scale:       2285.8280       0.2670       0.2800       0.3400
Add:         2461.6513       0.3710       0.3900       0.4600
Triad:       2285.8229       0.3920       0.4200       0.5000

-O3 
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2461.5867       0.2730       0.2600       0.3400
Scale:       2370.4237       0.2910       0.2700       0.3500
Add:         2526.3684       0.4050       0.3800       0.4800
Triad:       2341.5151       0.4320       0.4100       0.5100

The strange thing is they are 32 bit binaries, despite being built
on a 64 bit os on a 64 bit hardware.

I played around with various mentioned optimizations (including -xW)
on the manpage, I never managed a 64 bit binary with icc-8.1 though.
The man page has numerous i32em and em64t references.




-- 
Bill Broadley
Computational Science and Engineering
UC Davis



More information about the Beowulf mailing list