<html><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"></head><body>
<div>Hi Lance,</div><div>I am curious about how do you encapsulate the job in the right cgroups in slurm. Could you please give us some details ?</div><div>Comcerning Docker as a login node (or other "service" nodes) how do you manage its deployment ? Basically, with registry and pull, with swarm/mesos/kubernetes ?</div><div><br></div><div>@John:</div><div> I know there is a tight integration of Docker in HTCondor (see Docher HTcondor universe application); it could be modified easily to submit singularity jobs (I know some ppl are doing this). On our clusters we are using singularity like any other apps with environment modules+SGE (without cgroups).</div><div>Concerning LXD we used to deploy some containers of that type to some external machines. Then, these containers were connected to the rest of the cluster (that is an easy way to make our cluster bigger). However, even if it works well within an experimental environment, it did not gave us full satisfaction in production (problems with isolating the container from the host).</div><div>For the service nodes perhaps you can also look at proxmox to create LXC containers and manage these ones more easily.</div><div><br></div><div>Best regards</div><div>Remy</div><div><br></div><div><br></div><div><br></div><div id="composer_signature"><div style="font-size:88%;color:#364f67" dir="auto">Envoyé depuis mon appareil Samsung</div></div><br><br>-------- Message d'origine --------<br>De : Lance Wilson <lance.wilson@monash.edu> <br>Date : 16/06/2017 01:30 (GMT+01:00) <br>À : John Hearns <hearnsj@googlemail.com> <br>Cc : Beowulf Mailing List <beowulf@beowulf.org> <br>Objet : Re: [Beowulf] LXD containers for cluster services and cgroups? <br><br><div dir="ltr">Hi John,<div>In regards to your Singularity question we are using cgroups for the containers. Mostly the containers are used in Slurm jobs which creates the appropriate cgroups. We are also using the gpu driver passthrough functionality of Singularity now for our machine learning and cryoem processing containers which have the cgroups applied to gpus.</div><div><br></div><div>Back to your systems containers questions many of our systems have been put into docker containers as they run on same/similar operating system and still need root to function correctly. Pretty much every new system thing we do is scripted and put into a container so that we can recover quickly in an outage scenario and move around things as part of our larger cloud (private and public) strategy.</div></div><div class="gmail_extra"><br clear="all"><div><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr">Cheers,<br><br>Lance<br>--<br>Dr Lance Wilson<br>Senior HPC Consultant</div><div>Ph: 03 99055942 (+61 3 99055942</div><div dir="ltr">Mobile: 0437414123 (+61 4 3741 4123)</div><div dir="ltr">Multi-modal Australian ScienceS Imaging and Visualisation Environment<br>(<a href="http://www.massive.org.au/" rel="noreferrer" style="color:rgb(17,85,204)" target="_blank">www.massive.org.au</a>)<br>Monash University<br></div></div></div></div></div>
<br><div class="gmail_quote">On 15 June 2017 at 20:06, John Hearns <span dir="ltr"><<a href="mailto:hearnsj@googlemail.com" target="_blank">hearnsj@googlemail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div>I'm not sure this post is going to make a lot of sense. But please bear with me!</div><div>For applications containers are possible using Singularity or Docker of course.</div><div><br></div><div>In HPC clusters we tend to have several 'service node' activities, such as the cluster management/ head node, perhaps separate provisioning nodes to spread the load, batch queue system masters, monitoring setups, job submission and dedicated storage nodes.</div><div><br></div><div>These can all of course be run on a single cluster head node in a small setup (with the exception of the storage nodes). In a larger setup you can run these services in virtual machines.</div><div><br></div><div>What I am asking is anyone using technologies such as LXD containers to run these services?</div><div>I was inspired by an Openstack talk by James Page at Canonical, where all the Opestack services were deployed by Juju charms onto LXD containers.</div><div>So we pack all the services into containers on physical server(s) which makes moving them or re-deploying things very flexible.</div><div><a href="https://www.youtube.com/watch?v=5orzBITR3X8" target="_blank">https://www.youtube.com/watch?<wbr>v=5orzBITR3X8</a></div><div><br></div><div>While I'm talking abotu containers, is anyone deploying singularity containers in cgroups, and limiting the resources they can use (I'm specifically thinking of RDMA here).</div><div><br></div><div><br></div><div><br></div><div>ps. I have a terrible sense of deja vu here... I think I asked the Singularity question a month ago.</div><div>I plead insanity m'lord</div><div><br></div><div><br></div></div>
<br>______________________________<wbr>_________________<br>
Beowulf mailing list, <a href="mailto:Beowulf@beowulf.org">Beowulf@beowulf.org</a> sponsored by Penguin Computing<br>
To change your subscription (digest mode or unsubscribe) visit <a href="http://www.beowulf.org/mailman/listinfo/beowulf" rel="noreferrer" target="_blank">http://www.beowulf.org/<wbr>mailman/listinfo/beowulf</a><br>
<br></blockquote></div><br></div>
</body></html>