[Beowulf] Infiniband modular switches

Patrick Geoffray patrick at myri.com
Fri Jun 13 12:43:31 PDT 2008


Hi Don,

Don Holmgren wrote:
> latency difference here matters to many codes).  Perhaps of more 
> significance, though, is that you can use oversubscription to lower the 
> cost of your fabric.  Instead of connecting 12 ports of a leaf switch to 
> nodes and using the other 12 ports as uplinks, you might get away with 
> 18 nodes and 6 uplinks, or 20 nodes and 4 uplinks.  As core counts are 
> increasing, this is becoming more and more viable for some applications.

It's important to note that the "full-bisection" touted by vendors is on 
paper only. In reality, static routing provides full-bisection for a 
very small subset of patterns, the average effective bisection on a 
diameter-3 Clos is ~40% of link rate (adaptive routing improves that a 
lot, but breaks packet order on the wire which is a requirement for some 
network protocols).

In practice, "paper" full-bisection is near free when using a single 
enclosure, since all spine cables are on the backplane. For larger 
networks, where you have to pay for real cables to the spine level, then 
it may make sense to be oversubscribed if the effective bisection is 
already bad (static routing), or if your collective communication on 
large jobs are not bandwidth bounded. However, the later is often false 
on many-cores.

Patrick



More information about the Beowulf mailing list