[Beowulf] 1.2 us IB latency?

Mark Hahn hahn at mcmaster.ca
Tue Mar 27 18:36:42 PDT 2007


>> a dramatic breakthrough: 1.2 us, 25M msg/s.  since we just happend to
>>
>> or is this an example of message aggregation?  heck, from the url
>> above, it might even be counting intra-box messages.
>
> Nope, this is "normal" ping-pong for the new generation cards (connectx).

so if I do this:

start timer 
send(other,small-message)		recv(first,small-message)
recv(other,small-message)		send(first,small-message)
stop timer

I'll actually see 2.4 us between the timer calls?  if I understand,
aggregation would only help on a streaming test.  in fact, this kind 
of isolated RPC-like exchange is what I see most commonly.

> Maybe a bit optimistic though, I'd expect closer to 1.5 in a back-to-back
> config.

so for a small switch (24pt, say), how many hops to the internal fabric,
and they're, what, .2 us each?

also, does back-to-back work well?  I can imagine some cases where 
putting two dual-port cards in each node and creating a mesh might 
work well.

>> also, I'm sorta amazed people keep selling (and presumably buying)
>> dual-port IB cards.  doesn't that get quite expensive, switch-wise?
>
> Not defending them but, It could possibly maybe be useful if you have a
> stand-alone IB net for, say, storage or something else not mpi. Also, it's
> not like they're that much more expensive than single port ones...

yeah, I can see PHB's buying redundant fabrics.  I'd be more interested in
using the higher port-count for FNN or related topologies (assuming switches 
are cheap, at least at some size...)



More information about the Beowulf mailing list