Distributed Fault Tolerance: Lessons from Delta-4

  • Authors:
  • David Powell

  • Affiliations:
  • -

  • Venue:
  • IEEE Micro
  • Year:
  • 1994

Quantified Score

Hi-index 0.02

Visualization

Abstract

Because they avoid extensive redesign of specialized hardware, software-implemented approaches to fault tolerance are very resilient to change. Europe's Delta-4 project argues persuasively for implementing fault tolerance in a distributed fashion. The Delta-4 approach achieves fault tolerance by replicating capsules/spl minus/runtime representations of application objects/spl minus/on distributed, LAN-interconnected nodes. It can configure capsule groups to tolerate either stopping or arbitrary failures. Its multipoint protocols serve to coordinate capsule groups and for error processing and fault treatment.