[Beowulf] AMD64 results...
    Bill Broadley 
    bill at cse.ucdavis.edu
       
    Wed Dec 15 22:59:52 PST 2004
    
    
  
> ironically, icc -xW generates pretty good-for-opteron code,
> though of course, it's 32b.  I haven't tried using icc to
> generate em64t/and64 code.
> 
> regards, mark hahn.
Ah, got icc-8.1 to cooperate, dual 2.2 Ghz opteron+pc3200+2.4 kernel,
915.5MB array:
-O1
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2285.8039       0.2640       0.2800       0.3200
Scale:       2206.9798       0.2690       0.2900       0.3000
Add:         2341.5554       0.3740       0.4100       0.4200
Triad:       2181.9031       0.4060       0.4400       0.4800
-O2
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2370.4856       0.2570       0.2700       0.3400
Scale:       2285.8280       0.2670       0.2800       0.3400
Add:         2461.6513       0.3710       0.3900       0.4600
Triad:       2285.8229       0.3920       0.4200       0.5000
-O3 
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        2461.5867       0.2730       0.2600       0.3400
Scale:       2370.4237       0.2910       0.2700       0.3500
Add:         2526.3684       0.4050       0.3800       0.4800
Triad:       2341.5151       0.4320       0.4100       0.5100
The strange thing is they are 32 bit binaries, despite being built
on a 64 bit os on a 64 bit hardware.
I played around with various mentioned optimizations (including -xW)
on the manpage, I never managed a 64 bit binary with icc-8.1 though.
The man page has numerous i32em and em64t references.
-- 
Bill Broadley
Computational Science and Engineering
UC Davis
    
    
More information about the Beowulf
mailing list