[Beowulf] Accelerator for data compressing
Dmitri Chubarov
dmitri.chubarov at gmail.com
Tue Oct 7 11:34:41 PDT 2008
>
> 2D finite difference can be comm intensive is the mesh is too small for each
> processor to have a fair amount of work to do before needing the neighboring
> values from a "far" node.
>
Actually it seems that with VX50 the same node may be the "far" node.
At least that's what I see
from the NUMA Analysis test from TAU Wiki:
http://www.nic.uoregon.edu/tau-wiki/Guide:Opteron_NUMA_Analysis
Do not have the numbers at hand but that was the impression.
>
> How do you identify the specific instruction using a profiler, this is
> something that interests me.
>
I am using the Performance Analyzer that comes with Sun Studio 12. It
provides a per instruction profile view of the disassembly.
More information about the Beowulf
mailing list