It seems that when many people design their cluster they tend to build them with a power of 2 number of compute nodes. Is this just because we are all used to thinking in binary or are there codes (not benchmarks) that require or optimize for N=2^m nodes/procs? If so, what are you running? Thanks in advance for your thoughts. Dan