DMTCP: Distributed MultiThreaded CheckPointing

About

DMTCP (Distributed MultiThreaded Checkpointing) is a tool to transparently checkpointing the state of an arbitrary group of programs spread across many machines and connected by sockets. It does not modify the user's program nor the operating system.

For further information, try the DMTCP Sourceforge project page

Documentation

The dmtcp manpage contains an overview of commands and usage. Similar information can also be found in here (slightly outdated).

Getting DMTCP

The latest stable version of DMTCP can be obtained through the sourceforge.net downloads page. Including:

To obtain the most recent (possibly unstable) source from subversion, run the following command:

Programming Interface

DMTCP is completely transparent and can checkpoint unmodified Linux binaries. However, if you wish to call DMTCP from within your checkpointed program, we provide an optional programming interface called DMTCP Aware. To use DMTCP Aware:

Authors

DMTCP and its standalone single-process compontent MTCP (MultiThreaded CheckPointing) were created and are maintained by Jason Ansel, Kapil Arya, Gene Cooperman, Mike Rieker, Ana Maria Visan, and Alex Brick.




SourceForge.net Logo