An Efficient Checkpointing Protocol for the Minimal Characterization of Operational Rollback-Dependency Trackability

  • Authors:
  • Islene C. Garcia;Luiz E. Buzato

  • Affiliations:
  • Universidade Estadual de Campinas, Brasil;Universidade Estadual de Campinas, Brasil

  • Venue:
  • SRDS '04 Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

A checkpointing protocol that enforces rollback-dependency trackability (RDT) during the progress of a distributed computation must induce processes to take forced checkpoints to avoid the formation of non-trackable rollback dependencies. A protocol based on the minimal characterization of RDT tests only the smallest set of non-trackable dependencies. The literature indicated that this approach would require the processes to maintain and propagate O(n^2) control information, where n is the number of processes in the computation. In this paper, we present a protocol that implements this approach using only O(n) control information.