Fault-tolerant wait-free shared objects

Authors:
Prasad Jayanti;Tushar Deepak Chandra;Sam Toueg
Affiliations:
Dartmouth College, Honover, NH;IBM T. J. Watson Research Center, Hawthorne, NY;Cornell Univ., Ithaca, NY
Venue:
Journal of the ACM (JACM)
Year:
1998

Citing 29
Cited 41

Easy impossibility proofs for distributed consensus problems

Distributed Computing
On the minimal synchronism needed for distributed consensus

Journal of the ACM (JACM)
Achieving consensus in fault-tolerant distributed computer systems: protocols, lower bounds, and simulations

Achieving consensus in fault-tolerant distributed computer systems: protocols, lower bounds, and simulations
The elusive atomic register revisited

PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
Constructing multi-reader atomic values from non-atomic values

PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
A protocol for wait-free, atomic, multi-reader shared variables

PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
Constructing two-writer atomic registers

PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
Converting Lamport's regular register to atomic register

Information Processing Letters
Impossibility and universality results for wait-free synchronization

PODC '88 Proceedings of the seventh annual ACM Symposium on Principles of distributed computing
An elegant 1-writer multireader multivalued atomic register

Information Processing Letters
Sticky bits and universality of consensus

Proceedings of the eighth annual ACM Symposium on Principles of distributed computing
Linearizability: a correctness condition for concurrent objects

ACM Transactions on Programming Languages and Systems (TOPLAS)
Time-and space-efficient randomized consensus

PODC '90 Proceedings of the ninth annual ACM symposium on Principles of distributed computing
Early stopping in Byzantine agreement

Journal of the ACM (JACM)
Implementing fault-tolerant services using the state machine approach: a tutorial

ACM Computing Surveys (CSUR)
Wait-free synchronization

ACM Transactions on Programming Languages and Systems (TOPLAS)
Randomized wait-free concurrent objects (extended abstract)

PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
Computing with faulty shared memory

PODC '92 Proceedings of the eleventh annual ACM symposium on Principles of distributed computing
Modular construction of a Byzantine agreement protocol with optimal message bit complexity

Information and Computation
Resource bounds and combinations of consensus objects

PODC '93 Proceedings of the twelfth annual ACM symposium on Principles of distributed computing
Impossibility of distributed consensus with one faulty process

Journal of the ACM (JACM)
Computing with faulty shared objects

Journal of the ACM (JACM)
Reaching Agreement in the Presence of Faults

Journal of the ACM (JACM)
The Byzantine Generals Problem

ACM Transactions on Programming Languages and Systems (TOPLAS)
A New Solution to Lamport's Concurrent Programming Problem Using Small Shared Variables

ACM Transactions on Programming Languages and Systems (TOPLAS)
Concurrent reading and writing

Communications of the ACM
Concurrent control with “readers” and “writers”

Communications of the ACM
Some Results on the Impossibility, Universality, and Decidability of Consensus

WDAG '92 Proceedings of the 6th International Workshop on Distributed Algorithms
Fault-tolerant Wait-free Shared Objects

Fault-tolerant Wait-free Shared Objects

Constructing a Reliable Test&Set Bit

IEEE Transactions on Parallel and Distributed Systems
Conditions on input vectors for consensus solvability in asynchronous distributed systems

STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Active disk paxos with infinitely many processes

Proceedings of the twenty-first annual symposium on Principles of distributed computing
Wait-free Byzantine consensus

Information Processing Letters
Lower Bounds in Distributed Computing

DISC '00 Proceedings of the 14th International Conference on Distributed Computing
Objects Shared by Byzantine Processes

DISC '00 Proceedings of the 14th International Conference on Distributed Computing
On using network attached disks as shared memory

Proceedings of the twenty-second annual symposium on Principles of distributed computing
Applications of Probabilistic Quorums to Iterative Algorithms

ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Conditions on input vectors for consensus solvability in asynchronous distributed systems

Journal of the ACM (JACM)
Hundreds of impossibility results for distributed computing

Distributed Computing - Papers in celebration of the 20th anniversary of PODC
Randomized protocols for asynchronous consensus

Distributed Computing - Papers in celebration of the 20th anniversary of PODC
A pleasant stroll through the land of infinitely many creatures

ACM SIGACT News
Byzantine disk paxos: optimal resilience with byzantine shared memory

Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Objects shared by Byzantine processes

Distributed Computing
Wait-free computing: an introductory lecture

Future Generation Computer Systems - Special issue: Parallel computing technologies
Active disk Paxos with infinitely many processes

Distributed Computing - Special issue: PODC 02
How fast can a very robust read be?

Proceedings of the twenty-fifth annual ACM symposium on Principles of distributed computing
Randomized registers and iterative algorithms

Distributed Computing
Tight bounds for shared memory systems accessed by Byzantine processes

Distributed Computing - Special issue: DISC 03
Light-weight leases for storage-centric coordination

International Journal of Parallel Programming
Wait-free regular storage from Byzantine components

Information Processing Letters
Refined quorum systems

Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Efficient fork-linearizable access to untrusted shared memory

Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
On the Time-Complexity of Robust and Amnesic Storage

OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
Low-latency access to robust amnesic storage

LADIS '08 Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware
Efficient Robust Storage Using Secret Tokens

SSS '09 Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems
Abortable Fork-Linearizable Storage

OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
Wait-free computing: an introductory lecture

Future Generation Computer Systems - Special issue: Parallel computing technologies
Revisiting the impossibility for boosting service resilience

TAMC'07 Proceedings of the 4th international conference on Theory and applications of models of computation
A data-centric approach for scalable state machine replication

Future directions in distributed computing
Data-centric reconfiguration with network-attached disks

Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
DepSky: dependable and secure storage in a cloud-of-clouds

Proceedings of the sixth conference on Computer systems
Resilience of mutual exclusion algorithms to transient memory faults

Proceedings of the 30th annual ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Fast Access to Distributed Atomic Memory

SIAM Journal on Computing
Bounded wait-free f-resilient atomic Byzantine data storage systems for an unbounded number of clients

DISC'06 Proceedings of the 20th international conference on Distributed Computing
Musical benches

DISC'05 Proceedings of the 19th international conference on Distributed Computing
Fork-Consistent constructions from registers

OPODIS'11 Proceedings of the 15th international conference on Principles of Distributed Systems
From unreliable objects to reliable objects: the case of atomic registers and consensus

PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
Amnesic distributed storage

DISC'07 Proceedings of the 21st international conference on Distributed Computing
PoWerStore: proofs of writing for efficient and robust storage

Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security
DepSky: Dependable and Secure Storage in a Cloud-of-Clouds

ACM Transactions on Storage (TOS)

Quantified Score

Hi-index	0.01

Visualization

Abstract

Wait-free implementations of shared objects tolerate the failure of processes, but not the failure of base objects from which they are implemented. We consider the problem of implementing shared objects that tolerate the failure of both processes and base objects.We identify two classes of object failures: responsive and nonresponsive. With responsive failures, a faulty object responds to every operation, but its responses may be incorrect. With nonresponsive failures, a faulty object may also “hang” without responding. In each class, we define crash, omission, and arbitrary modes of failure.We show that all responsive failure modes can be tolerated. More precisely, for all responsive failure modes F, object types T, and t &ohgr; 0, we show how to implement a shared object of type T which is t-tolerant for F. Such an object remains correct and wait-free even if up to t base objects fail according to F. In contrast to responsive failures, we show that even the most benign non-responsive failure mode cannot be tolerated. We also show that randomization can be used to circumvent this impossibility result.Graceful degradation is a desirable property of fault-tolerant implementations: the implemented object never fails more severely than the base objects it is derived from, even if all the base objects fail. For several failure modes, we show wheter this property can be achieved, and, if so, how.