A necessary and sufficient condition for transforming limited accuracy failure detectors

Authors:
E. Anceaume;A. Fernández;A. Mostefaoui;G. Neiger;M. Raynal
Affiliations:
IRISA, Campus de Beaulieu, Université de Rennes 1, 35042 Rennes Cedex, France;Universidad Rey Juan Carlos, 28933 Móstoles, Madrid, Spain;IRISA, Campus de Beaulieu, Université de Rennes 1, 35042 Rennes Cedex, France;Intel Corporation, JF3-332, 2111 NE 25th Avenue, Hillsboro, OR;IRISA, Campus de Beaulieu, Université de Rennes 1, 35042 Rennes Cedex, France
Venue:
Journal of Computer and System Sciences
Year:
2004

Citing 12
Cited 6

On the minimal synchronism needed for distributed consensus

Journal of the ACM (JACM)
Extended impossibility results for asynchronous complete networks

Information Processing Letters
Consensus in the presence of partial synchrony

Journal of the ACM (JACM)
More choices allow more faults: set consensus problems in totally asynchronous systems

Information and Computation
Impossibility of distributed consensus with one faulty process

Journal of the ACM (JACM)
Unreliable failure detectors for reliable distributed systems

Journal of the ACM (JACM)
Structured derivations of consensus algorithms for failure detectors

PODC '98 Proceedings of the seventeenth annual ACM symposium on Principles of distributed computing
Restricted failure detectors: definition and reduction protocols

Information Processing Letters
k-set agreement with limited accuracy failure detectors

Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
A Versatile Family of Consensus Protocols Based on Chandra-Toueg's Unreliable Failure Detectors

IEEE Transactions on Computers
"Gamma-Accurate" Failure Detectors

WDAG '96 Proceedings of the 10th International Workshop on Distributed Algorithms
A simple and fast asynchronous consensus protocol based on a weak failure detector

Distributed Computing

On the Implementation of Unreliable Failure Detectors in Partially Synchronous Systems

IEEE Transactions on Computers
Communication-efficient leader election and consensus with limited link synchrony

Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Wait-free dining under eventual weak exclusion

ICDCN'08 Proceedings of the 9th international conference on Distributed computing and networking
The Asynchronous Bounded-Cycle model

Theoretical Computer Science
Building and using quorums despite any number of process of crashes

EDCC'05 Proceedings of the 5th European conference on Dependable Computing
Ω meets paxos: leader election and stability without eventual timely links

DISC'05 Proceedings of the 19th international conference on Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Unreliable failure detectors are oracles that give information about process failures. Chandra and Toueg were first to study such failure detectors for distributed systems, and they identified a number that enabled the solution of the Consensus problem in asynchronous distributed systems. This paper focuses on two of these, denoted J (strong) and ♦J (eventually strong). The characteristics of a given unreliable failure detector are usually described by its completeness and accuracy properties. Completeness is a requirement on the actual detection of failures, while accuracy limits the mistakes a failure detector can make. Let the scope of the accuracy property of an unreliable failure detector be the minimum number (k) of processes that may not erroneously suspect a correct process to have crashed. Usual failure detectors implicitly consider a scope equal to n (the total number of processes). Accuracy properties with limited scope give rise to the classes of failure detectors that we call Jk and ♦Jk. This paper investigates the following question: "Given Jk and ♦Jk, under which condition is it possible to transform their failure detectors into their counterparts with unlimited accuracy, i.e., AP and J ♦J?". The paper answers this question in the following way. It first presents a particularly simple protocol that realizes such a transformation when f k (where f is the maximum number of processes that may crash). Then, it shows that there is no reduction protocol when f ≥ k.