Communication-Induced Determination of Consistent Snapshots
IEEE Transactions on Parallel and Distributed Systems
Interval consistency of asynchronous distributed computations
Journal of Computer and System Sciences
An Index-Based Mobile Checkpointing and Recovery Algorithm
ICDCN '09 Proceedings of the 10th International Conference on Distributed Computing and Networking
Hi-index | 0.03 |
The paper presents an index based checkpointing algorithm for distributed systems with the aim of reducing the total number of checkpoints while ensuring that each checkpoint belongs to at least one consistent global checkpoint (or recovery line). The algorithm is based on an equivalence relation defined between pairs of successive checkpoints of a process which allows, in some cases, to advance the recovery line of the computation without forcing check points in other processes. This protocol shows good performance, especially in autonomous environments, where each process does not have any private information about other processes.