Fwd: [Beowulf] Configuration change monitoring
Gerry Creager
gerry.creager at tamu.edu
Thu Aug 30 12:54:28 PDT 2007
Walid wrote:
> ---------- Forwarded message ----------
> From: *Walid* <walid.shaari at gmail.com <mailto:walid.shaari at gmail.com>>
> Date: Aug 30, 2007 9:22 PM
> Subject: Re: [Beowulf] Configuration change monitoring
> To: "Robert G. Brown" <rgb at phy.duke.edu <mailto:rgb at phy.duke.edu>>
>
> Hi Robert
>
> On 8/30/07, *Robert G. Brown * <rgb at phy.duke.edu
> <mailto:rgb at phy.duke.edu>> wrote:
>
> To amplify Mark's remark a bit -- linux in general already has many
> fairly powerful tools for DOING monitoring, updates, and so on. For
> example, one can use e.g. yum, kickstart, apt, and more to install a
> "canned" node configuration and keep it up to date. It is so totally
> automatic that there is basically no need to "check for inconsistencies"
> on a node. Warewulf and several other tools also permit one to have
> rigorous control over node configuration.
>
>
> Some of the inconsistencies we see is usually when one of the admin
> choses the easy routes, and does changes manually to a set of nodes, and
> does not update the installation files (our installations are rpm
> kickstart based), others are becuase we manage quite a large variants
> and by mistake or intention configurations files that are not supposed
> to be on cluster A appears on Cluster B.
>
> Monitoring tools abound, of course, ranging from things like syslog-ng
> for centralized monitoring/logging of LAN systems activity to nightly
> ... ...
> Is there something else one needs to do? Well, most cluster admins tend
> to be fairly skilled linux administrators and good at shell script magic
> or even real programming. So if one has an edge-case need that isn't
> directly met by one of the available tools, it is usually a fairly
> simple matter to hack out a script to accomplish it.
>
>
> That is one reason why they want to push some commercial tools, They
> want to minimize the amount of customizations that are done locally by
> the team, as some writes in bash, another writes in Perl, and the other
> prefers Python, in the long run there is an issue of maintaining the
> scripts, and knowing which set of scripts belongs to which cluster, and
> what kind of modifications are needed for new clusters,.etc all of which
> is documented, but that is another story
More than likely you'll end up with commercial "solutions" that preclude
a rapid response to a problem and take your system down. A good admin
team communicates and you don't see confusion at the level you are
describing.
gerry
--
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University
Cell: 979.229.5301 Office: 979.862.3982 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843
More information about the Beowulf
mailing list