<div dir="ltr">Indeed. Some interesting news here:<div><br></div><div><a href="http://www.enterprisetech.com/2016/03/04/docker-acquires-apache-aurora-founders/">http://www.enterprisetech.com/2016/03/04/docker-acquires-apache-aurora-founders/</a><br></div><div><br></div><div>Us old style guys are going to have our lunch money stolen by young upstarts. Or is that startups?</div><div>Seriously - these guys know how to keep things running at scale and how to tolerate failures.</div><div><br></div><div><br></div><div><br></div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On 3 March 2016 at 23:30, Christopher Samuel <span dir="ltr"><<a href="mailto:samuel@unimelb.edu.au" target="_blank">samuel@unimelb.edu.au</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On 04/03/16 06:40, Douglas Eadline wrote:<br>
<br>
> Yes, failure needs to be option.<br>
<br>
</span>The Slurm folks have been working on failure management support for a<br>
little while, the idea being you can have a pool of spare nodes to pick<br>
from (or alternatively bargain with a scheduler for a node that's<br>
currently busy to come free later on and then add it to the job,<br>
potentially extending the walltime to make up for the shortfall).<br>
<br>
A better description from someone with higher caffeination is here:<br>
<br>
<a href="http://slurm.schedmd.com/nonstop.html" rel="noreferrer" target="_blank">http://slurm.schedmd.com/nonstop.html</a><br>
<br>
All the best,<br>
Chris<br>
<span class="HOEnZb"><font color="#888888">--<br>
Christopher Samuel Senior Systems Administrator<br>
VLSCI - Victorian Life Sciences Computation Initiative<br>
Email: <a href="mailto:samuel@unimelb.edu.au">samuel@unimelb.edu.au</a> Phone: <a href="tel:%2B61%20%280%293%20903%2055545" value="+61390355545">+61 (0)3 903 55545</a><br>
<a href="http://www.vlsci.org.au/" rel="noreferrer" target="_blank">http://www.vlsci.org.au/</a> <a href="http://twitter.com/vlsci" rel="noreferrer" target="_blank">http://twitter.com/vlsci</a><br>
</font></span><div class="HOEnZb"><div class="h5"><br>
_______________________________________________<br>
Beowulf mailing list, <a href="mailto:Beowulf@beowulf.org">Beowulf@beowulf.org</a> sponsored by Penguin Computing<br>
To change your subscription (digest mode or unsubscribe) visit <a href="http://www.beowulf.org/mailman/listinfo/beowulf" rel="noreferrer" target="_blank">http://www.beowulf.org/mailman/listinfo/beowulf</a><br>
</div></div></blockquote></div><br></div>