The Impossibility of Boosting Distributed Service Resilience

Authors:
Paul Attie;Rachid Guerraoui;Petr Kouznetsov;Nancy Lynch;Sergio Rajsbaum
Affiliations:
Northeastern University and Massachusetts Institute of Technology;EPFL;EPFL;Massachusetts Institute of Technology;Universidad Nacional Autónoma de México
Venue:
ICDCS '05 Proceedings of the 25th IEEE International Conference on Distributed Computing Systems
Year:
2005

Citing 0
Cited 4

Classifying rendezvous tasks of arbitrary dimension

Theoretical Computer Science
Revisiting the impossibility for boosting service resilience

TAMC'07 Proceedings of the 4th international conference on Theory and applications of models of computation
The impossibility of boosting distributed service resilience

Information and Computation
(Almost) all objects are universal in message passing systems

DISC'05 Proceedings of the 19th international conference on Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We prove two theorems saying that no distributed system in which processes coordinate using reliable registers and f-resilient services can solve the consensus problem in the presence of f+1 undetectable process stopping failures. (A service is f-resilient if it is guaranteed to operate as long as no more than f of the processes connected to it fail.) Our first theorem assumes that the given services are atomic objects, and allows any connection pattern between processes and services. In contrast, we show that it is possible to boost the resilience of systems solving problems easier than consensus: the k-set consensus problem is solvable for 2k-1 failures using 1-resilient consensus services. The first theorem and its proof generalize to the larger class of failure-oblivious services. Our second theorem allows the system to contain failure-aware services, such as failure detectors, in addition to failure-oblivious services; however, it requires that each failure-aware service be connected to all processes. Thus,thus f+1 process failures overall can disable all the failure-aware services. In contrast, it is possible to boost the resilience of a system solving consensus if arbitrary patternsof connectivity are allowed between processes and failure-aware services: consensus is solvable for any number of failures using only 1-resilient 2-process perfect failure detectors.