==> Regarding Re: Kernel-level checkpointing; Andrew Morton <andrewm@uow.edu.au> adds:
andrewm> Rayson Ho wrote:
>> Hi,
>>
>> I want to develop a user-level application for fault-tolerance
>> servers. Can someone tell me where I can get information about the
>> kernel-level checkpointing (i.e., to write the image and state of a
>> process to disk so that another computer can re-run that process)??
http> //www.cs.rochester.edu/~edpin/epckpt/
bproc provides similar functionality to this. Also check out MOSIX.
bproc:
http://www.beowulf.org/software/bproc.html
MOSIX:
http://www.cnds.jhu.edu/mirrors/mosix/
Note that neither solution tolerates node failure with open network
sockets. It is very difficult to checkpoint TCP/IP state.
Regards,
Jeff Moyer
http://www.missioncriticallinux.com/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/
This archive was generated by hypermail 2b29 : Mon May 15 2000 - 21:00:25 EST