[Beowulf] How to configure a cluster network
Patrick Geoffray
patrick at myri.com
Thu Jul 24 15:51:20 PDT 2008
Hi Jan,
Jan Heichler wrote:
> 1) most applications are latency driven - not bandwidth driven. That
> means that half bisectional bandwidth is not cutting your application
> performance down to 50%. For most applications the impact should be less
> than 5% - for some it is really 0%.
If the app is purely latency driven, bandwidth (link or bisection) is
indeed irrelevant. However, don't underestimate the impact of contention
on collective communication: once you exceed the internal buffering in
the crossbars, you will have back-pressure. Typically, each crossbar
port can buffer in the order of 1-10K these days. So, the larger the
message size for the collective and the larger the communicator, the
greater the need for effective bisection. At this scale (ie 50 nodes), I
agree it's not that important, unless you are bandwidth bounded to begin
with.
> 2) Static routing in IB networks limits your bandwidth for many of the
> possible communication patterns anyway. For completely random
> communication it was like below 50%. So you buy a IB fabric with full
> bisectional but can't use it anyway - reducing the bisectional bandwidth
> is not impacting that much anymore (as far as i understood most whitepapers)
With static routing on Fat Tree or Clos and pseudo-random traffic (ie
real world), you waste ~50% of the bisection you have (actually, the
more hops the more waste, but it's not linear). So, if you start with
half the theoretical bisection, your effective bisection will roughly be
a quarter of that.
Patrick
More information about the Beowulf
mailing list