[Beowulf] Exascale by the end of the year?
Christopher Samuel
samuel at unimelb.edu.au
Wed Mar 5 18:49:39 PST 2014
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 06/03/14 03:07, Joe Landman wrote:
> I've not done much with MPI in a few years, have they extended it
> beyond MPI_Init yet? Can MPI procs just join a "borgified"
> collective, preserve state so restarts/moves/reschedules of ranks
> are cheap? If not, what is the replacement for MPI that will do
> this?
Oops, forgot this in my previous email - I stumbled across the Uni of
Tenessee's ULFM (User Level Failure Mitigation) project which has a
Wordpress blog here:
http://fault-tolerance.org/
There is the PDF for a two page flyer from SC13 on the site which
gives an overview and describes it thus:
http://fault-tolerance.org/wp-content/uploads/2013/12/SC13-ULFM.pdf
# User Level Failure Mitigation is a set of MPI interface extensions
# enabling Message Passing programs to restore MPI communication
# capabilities affected by process failures. It supports rebuilding
# communicators, RMA windows and I/O Files
All the best,
Chris
- --
Christopher Samuel Senior Systems Administrator
VLSCI - Victorian Life Sciences Computation Initiative
Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
http://www.vlsci.org.au/ http://twitter.com/vlsci
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.14 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/
iEYEARECAAYFAlMX4kMACgkQO2KABBYQAh9iXgCffxwP07z91by2FCHxVRwtTl4Q
yTUAni3Xn0C+Nla0rS4HwW2dfF4Czb0Q
=yWTJ
-----END PGP SIGNATURE-----
More information about the Beowulf
mailing list