[Beowulf] What is a "proper" machine count for a cluster

John Hearns john.hearns at streamline-computing.com
Thu Mar 15 01:24:32 PDT 2007


Mathew Shember wrote:
> Hello all,
> 
> 
> I am just wondering what is a reasonable amount of computers to allocate 
> for playing around.
> 
> I was thinking 4 pentiums with dual gig processors.  Ram would be a gig 
> or more.
> 
Matthew, if I may put some words in your mouth what I THINK you are 
saying is that you have some surplus desktop machines which you can 
repurpose as a Linux compute cluster (Beowulf).
The list is here to help you, and it will be an excellent 
training/learning experience.
Look at the Cluster Live CD which was recently linked ot on the list.
Designate one machine as the master, and boot the rest off it.
Then look at Rocks and Warewulf.

However, your question is about "what is a reasonable amount of 
computers". That actually depends on your job load and more importantly 
your interconnect.
If you are doing serial jobs which use one machine, your cluster will 
scale, so the amount of useful work you get out goes up with machine 
count. We assume here you have enough RAM per processor.
However, for parallel work you have to have a fast interconnect.
I assume these machines have gigabit? Even if they have,
I would suggest buying some Intel PCI gigabit cards, they are very 
cheap. Put these in and get a decent quality gigabit switch to use as 
the parallel network.

Now comes the pitch.  The "sweet spot" in cluster nodes at the moment 
seems to be a dual socket, dual core Opteron or Intel machine with 2 
gigs of RAM per core, so each box is a 4-way SMP
(I will be flayed alive by the list for such cavalier numbers).
Now, such a box is likely to have oodles (scientific term) more 
processing power than four pentiums.

So I guess what I would say is - follow the learning curve which many 
folks here have done. Build that Beowulf with the four machines, get 
your code running on it. Download the evaluations of commercial 
compilers - that's an important part of Beowulfery too.
But next year, show the cluster to your bosses and get the budget for a 
cluster assembled and delivered by your friendly integrator. The 
"cluster" may be as small as one multi-core box of course.
The learning and real work you do with the four-box cluster will stand 
you in very good stead when speccing out a bigger solution.

















More information about the Beowulf mailing list