ACM Transactions on Programming Languages and Systems (TOPLAS)
Checkpointing and Rollback-Recovery for Distributed Systems
IEEE Transactions on Software Engineering - Special issue on distributed systems
Use of Common Time Base for Checkpointing and Rollback Recovery in a Distributed System
IEEE Transactions on Software Engineering
Necessary and Sufficient Conditions for Consistent Global Snapshots
IEEE Transactions on Parallel and Distributed Systems
Distributed snapshots: determining global states of distributed systems
ACM Transactions on Computer Systems (TOCS)
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Rollback Recovery in Distributed Systems Using Loosely Synchronized Clocks
IEEE Transactions on Parallel and Distributed Systems
Efficient Rollback-Recovery Technique in Distributed Computing Systems
IEEE Transactions on Parallel and Distributed Systems
Finding Consistent Global Checkpoints in a Distributed Computation
IEEE Transactions on Parallel and Distributed Systems
Global States of a Distributed System
IEEE Transactions on Software Engineering
Hi-index | 0.00 |
In distributed systems running uncoordinated checkpointingschemes, a process should maintain several generationsof local checkpoints to improve dependability,because a global checkpoint, which is a set of local checkpoints,is not always consistent. In this paper, we present analgorithm for .nding a recovery line, where a given checkpointis the earliest, in uncoordinated checkpointingschemes. Numerical examples of probability for the existenceof a recovery line calculated with the proposedalgorithm are also presented.