<div dir="ltr"><div class="gmail_default" style="font-family:monospace,monospace">Interesting. Ansible has come up a few times.</div><div class="gmail_default" style="font-family:monospace,monospace"><br></div><div class="gmail_default" style="font-family:monospace,monospace">Our largest cluster is 2000 KNL nodes and we are looking towards 10k... so it needs to scale well :)</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 5, 2017 at 1:46 PM, Lachlan Musicman <span dir="ltr"><<a href="mailto:datakid@gmail.com" target="_blank">datakid@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><span class="">On 5 September 2017 at 15:24, Stu Midgley <span dir="ltr"><<a href="mailto:sdm900@gmail.com" target="_blank">sdm900@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div style="font-family:monospace,monospace">Morning everyone</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">I am in the process of redeveloping our cluster deployment and config management environment and wondered what others are doing?</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">First, everything we currently have is basically home-grown.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">Our cluster deployment is a system that I've developed over the years and is pretty simple - if you know BASH and how pxe booting works. It has everything from setting the correct parameters in the bios, zfs ram disks for the OS, lustre for state files (usually in /var) - all in the initrd.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">We use it to boot cluster nodes, lustre servers, misc servers and desktops.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">We basically treat everything like a cluster.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">However... we do have a proliferation of images... and all need to be kept up-to-date and managed. Most of the changes from one image to the next are config files.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">We don't have a good config management (which might, hopefully, reduce the number of images we need). We tried puppet, but it seems everyone hates it. Its too complicated? Not the right tool?</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">I was thinking of using git for config files, dumping a list of rpm's, dumping the active services from systemd and somehow munging all that together in the initrd. ie. git checkout the server to get config files and systemctl enable/start the appropriate services etc.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">It started to get complicated.</div><div style="font-family:monospace,monospace"><br></div><div style="font-family:monospace,monospace">Any feedback/experiences appreciated. What works well? What doesn't?</div></div></blockquote><div><br><br></div></span><div>We are a small installation, with manageable needs. In our first step up from where you are, we ended up on:<br><br>- Katello/Foreman (in RedHat it's called Satellite) for management of software repositories, in discrete sets and slices. We started with Spacewalk but it is a little old and fusty and just isn't appropriate anymore.<br></div><div>- git for config management of environment module files<br></div><div>- Ansible for easy day to day management of servers <br><br>We no longer manage configs as such, since there is a shared data store, and the Ansible/Katello mix means we can rebuild any server from scratch.<br><br></div><div>Note that Ansible and Katello/Foreman can be integrated - we haven't gone that far yet. Are quite happy with the two being apart. That will change in the near future I think.<br><br></div><div>Cheers<br></div><div>L.<br></div><div><br><br><br><br clear="all">------<br>"The antidote to apocalypticism is
<b>apocalyptic civics</b>. Apocalyptic civics is the
insistence that we cannot ignore the truth, nor should we panic about
it. It is a shared consciousness that our institutions have failed and
our ecosystem is collapsing, yet we are still here — and we are creative
agents who can shape our destinies. Apocalyptic civics is the
conviction that the only way out is through, and the only way through is
together. "<br><br><i>Greg Bloom</i> @greggish <a href="https://twitter.com/greggish/status/873177525903609857" target="_blank">https://twitter.com/greggish/<wbr>status/873177525903609857</a> <br></div></div><br><br></div></div>
</blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><font face="monospace, monospace">Dr Stuart Midgley<br><a href="mailto:sdm900@gmail.com" target="_blank">sdm900@gmail.com</a></font></div></div></div>
</div>