Implementing rollback-recovery coordinated checkpoints
ISSADS'05 Proceedings of the 5th international conference on Advanced Distributed Systems
Hi-index | 0.00 |
Communication induced checkpointing (CTC) is a style of rollback-recovery which allows processes in a distributed computation to take independent checkpoints without susceptibility to the domino effect. This style of recover has been subject to an increasing interest lately, but most of the work done is algorithmic in nature. This paper presents an analysis of CIC protocols based on a prototype implementation and validated simulations.