[Beowulf] Accelerator for data compressing

Dmitri Chubarov dmitri.chubarov at gmail.com
Tue Oct 7 11:34:41 PDT 2008


>
> 2D finite difference can be comm intensive is the mesh is too small for each
> processor to have a fair amount of work to do before needing the neighboring
> values from a "far" node.
>

Actually it seems that with VX50 the same node may be the "far" node.
At least that's what I see
from the NUMA Analysis test from TAU Wiki:
http://www.nic.uoregon.edu/tau-wiki/Guide:Opteron_NUMA_Analysis

Do not have the numbers at hand but that was the impression.

>
> How do you identify the specific instruction using a profiler, this is
> something that interests me.
>

I am using the Performance Analyzer that comes with Sun Studio 12. It
provides a per instruction profile view of the disassembly.



More information about the Beowulf mailing list