[Beowulf] best linux distribution
Bogdan Costescu
Bogdan.Costescu at iwr.uni-heidelberg.de
Tue Oct 9 03:36:15 PDT 2007
On Mon, 8 Oct 2007, Robert G. Brown wrote:
> RHEL/Centos are good where vendors require "binary compatibility" on
> closed source software, as the standard of said binary
> compatibility.
What strikes me in this whole discussion is the ideea of 'one
distribution fits all' when applied to all nodes of a cluster and all
applications that run on that cluster. In the days of PXE booting,
with several solutions readily available for either building a node
from scratch (like kickstart) or booting a prebuilt setup with
NFS-root or ramdisk, what's so difficult in matching on request a
node, an application and a distribution/custom setup ?
Real case: A quantum mechanics code that we have bought some years ago
was provided only as staticly-linked binaries. They have worked fine
on the current distros at that time and we have succesfully used them
on CentOS-3 (2.4 kernel). However we discovered the hard way on the
new CentOS-5 (2.6 kernel) that the statically linked binaries didn't
work anymore as the kernel interfaces have changed - but, after a few
lines were changed in the config files and the nodes rebooted, the
binaries were again happily running in their required configuration.
Of course, the admin is responsible in defining which
distributions/custom setups can run on a certain node, based on the
hardware of that node and the kernel of the distribution/custom setup.
But after this is done, the user can limit his/her jobs to running on
these nodes or ask the queueing system to set up a node according to
the requirements of the job (I think that term is 'provisioning').
Sure, it helps in this case to run a distribution with long support
(like RHEL/CentOS/SL, SLES or Ubuntu LTS) such that you don't have to
waste too much time yourself with updates, especially security related
ones.
> Far short of Debian, but plenty big enough to include just about all
> mainstream useful packages for any cluster or LAN.
I'm making sure that any cluster related package that is part of the
default distribution is not part of what the nodes get to run. Why ?
Because very often the common ground options used for building the
package (which is a good idea for a widely used distribution) don't
fit _my_ setup. So, I take the fact that the distibution offers me all
the needed tools as a fallback, but I'm always trying to match as well
as possible all the components. And if you search the archives of the
LAM/MPI mailing lists you'll see the larger picture...
--
Bogdan Costescu
IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: Bogdan.Costescu at IWR.Uni-Heidelberg.De
More information about the Beowulf
mailing list