[Beowulf] backtraces
Lombard, David N
dnlombar at ichips.intel.com
Tue Jun 12 07:02:45 PDT 2007
On Mon, Jun 11, 2007 at 10:00:02PM -0400, Mark Hahn wrote:
>
> part of the reason I got a kick out of this simple backtrace.so
> is indeed that it's quite possible to conceive of a checkpoint.so
> which uses /proc/$pid/fd and /proc/$pid/maps to do a possibly
> decent job of checkpointing at least serial codes non-intrusively.
>
Have you looked at Berkely Lab Checkpoint/Restart (BLCR) at
<http://ftg.lbl.gov/CheckpointRestart/CheckpointRestart.shtml>
It does far beyond serial codes; with proper support, it does MPI too...
--
David N. Lombard, Intel, Irvine, CA
I do not speak for Intel Corporation; all comments are strictly my own.
More information about the Beowulf
mailing list