[Beowulf] Re: [Bioclusters] FW: cluster newbie (fwd from dag at sonsorol.org)
Eugen Leitl
eugen at leitl.org
Wed Jan 5 12:12:27 PST 2005
----- Forwarded message from Chris Dagdigian <dag at sonsorol.org> -----
From: Chris Dagdigian <dag at sonsorol.org>
Date: Wed, 05 Jan 2005 14:43:55 -0500
To: "Clustering, compute farming & distributed computing in life science informatics" <bioclusters at bioinformatics.org>
Subject: Re: [Bioclusters] FW: cluster newbie
Organization: Bioteam Inc.
User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O; en-US;
rv:1.7.5) Gecko/20041217
Reply-To: "Clustering, compute farming & distributed computing in life science informatics" <bioclusters at bioinformatics.org>
Hi Nick,
I've written about this in the past; you can find stuff in the list
archives.
Some of the articles and presentations I've done on "bioclusters" are
linked off this URL: http://bioteam.net/dag/ - something there may be of
use.
In general I'd go with BioBrew or the ROCKS cluster kit ("roll") that
comes with Grid Engine as the scheduler if you are starting out.
If you want to roll your own cluster to have maximum flexibility and
really learn the behind the scenes stuff just pick the "best" (or your
favorite) componants to match the following requirements:
1. A Linux distribution
2. A Resource manager & scheduler software (Grid Engine, etc.)
3. Software for doing unattended "bare metal" installs and incremental
updates (SystemImager, etc.)
4. Management & monitoring packages (ganglia, nagios, bigbrother, etc.)
Take what you chose for #1-4 and put all the compute nodes on a private
gigabit ethernet switch. There should be a common NFS share for user
home directories and maybe for the scheduler system. Pick one node to be
the "master" node and connect one of its NICs to the "private" cluster
network and the 2nd NIC to the company/department network. This way you
and your users only have to login to and deal with the one single
"master" node.
-Chris
Nick D'Angelo wrote:
>
>All,
>
>I am sure this has been asked many times before, but what is the preferred
>method or perhaps 'best' method of clustering a few 3-5 RedHat or other
>Linux flavours to best suit our Bioinfo R and D group?
>
>I have come across biobrew with their own cd distribution install and also
>this group.
>
>I was going to originally look at Fedora core 2, but that appeared to be
>painful due to the kernel re-compile and to be honest, the documentation
>appeared to be quite poor, at least what I found.
>
> Any suggestions?
>
> Thanks,
--
Chris Dagdigian, <dag at sonsorol.org>
BioTeam - Independent life science IT & informatics consulting
Office: 617-665-6088, Mobile: 617-877-5498, Fax: 425-699-0193
PGP KeyID: 83D4310E iChat/AIM: bioteamdag Web: http://bioteam.net
_______________________________________________
Bioclusters maillist - Bioclusters at bioinformatics.org
https://bioinformatics.org/mailman/listinfo/bioclusters
----- End forwarded message -----
--
Eugen* Leitl <a href="http://leitl.org">leitl</a>
______________________________________________________________
ICBM: 48.07078, 11.61144 http://www.leitl.org
8B29F6BE: 099D 78BA 2FD3 B014 B08A 7779 75B0 2443 8B29 F6BE
http://moleculardevices.org http://nanomachines.net
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20050105/6dd9af58/attachment.sig>
More information about the Beowulf
mailing list