Modularizing fault-tolerant protocols

  • Authors:
  • Matti A. Hiltunen;Richard D. Schlichting

  • Affiliations:
  • The University of Arizona, Tucson, AZ;The University of Arizona, Tucson, AZ

  • Venue:
  • EW 5 Proceedings of the 5th workshop on ACM SIGOPS European workshop: Models and paradigms for distributed systems structuring
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

Fault-tolerant protocols such as group multicast and membership are important abstractions that help simplify the development of distributed operating systems with fault-tolerance requirements. Unfortunately, these protocols are often very complicated in their own right, which makes their design and implementation a non-trivial task. If the power of these abstractions is to be effectively utilized in future systems, further efforts to improve our understanding of these protocols and their fundamental properties are needed.Our current research is addressing these issues by applying modularization techniques to fault-tolerant protocols. Our approach is based on identifying orthogonal properties of a given protocol, and then realizing these properties as separate modules within a standard system framework. For example, group multicast protocols are often defined to be some combination of reliability, atomicity, and consistent ordering; reliability is the property that a given message is always delivered, atomicity that either all group members or no group member receives the message, and consistent ordering that all group members see the same (causal or total) order of messages. Our goal is to develop a new model for fault-tolerant protocols that will facilitate such modularization, and hence, make it easier to understand both the properties themselves and their inherent dependencies. This research builds on previous work involving Psync [Pete89] and Consul [Mish91, Mish92].