Performance Variations using MPI/Myrico
Thomas Davis
tadavis at lbl.gov
Fri Apr 27 10:05:45 PDT 2001
Patrick Geoffray wrote:
>
> Thomas Davis wrote:
> >
> > We are looking for sites that run Intel Linux/SMP(dual)/MPI/Myricom 2k,
> > and have experienced performance variations. IE, you've ran the NAS/FT
> > parallel benchmark, densely packed (using all CPU's on the nodes), and
> > noted that the runs come up different each time - and the difference is
> > not minor (as much as 80%).
>
> Hi Thomas,
>
> I think I know the machine you are thinking about :-)
> I have a lot of documentation (trace file, profiles, timings) for the NAS
> FT benchmark on this cluster. I would be happy to show some screenshot of
> the MPI trace but I am afraid it would be a too big file to send on the
> list.
>
Patrick, I'm sure you know too.. Like I said in a previous list
message, everyone else we've talked to is baffled too.
>
> > 3) Any ideas on what could cause this much variation?
>
> I have some ideas, but nothing I would bet on. Mainly cache trashing : the
> memory copy operation is improved with SSE by using the prefecthing
> support, and this prefetch bypass the L2 cache. Without SSE, the L2 cache
> is happilly flushed as a processor is doing a copy. As the FFT code
> include a copy step, who knows... :-)
>
> Ahh, I am eager to see dual athlon on the market...
>
We are eager too.. Believe me! You try selling PIII's to people used to
Cray/SP memory performance some day.. :-)
--
------------------------+--------------------------------------------------
Thomas Davis | ASG Cluster guy
tadavis at lbl.gov |
(510) 486-4524 | "80 nodes and chugging Captain!"
More information about the Beowulf
mailing list