On implementing omega with weak reliability and synchrony assumptions

Authors:
Marcos K. Aguilera;Carole Delporte-Gallet;Hugues Fauconnier;Sam Toueg
Affiliations:
Hp Labs Systems Research Center, Palo Alto, CA;LIAFA, Université D. Diderot, 2 Place Jussieu, 75251, Paris Cedex, France;LIAFA, Université D. Diderot, 2 Place Jussieu, 75251, Paris Cedex, France;University of Toronto, Toronto, Canada
Venue:
Proceedings of the twenty-second annual symposium on Principles of distributed computing
Year:
2003

Citing 15
Cited 44

Consensus in the presence of partial synchrony

Journal of the ACM (JACM)
Unreliable failure detectors for reliable distributed systems

Journal of the ACM (JACM)
The weakest failure detector for solving consensus

Journal of the ACM (JACM)
The part-time parliament

ACM Transactions on Computer Systems (TOCS)
Reducing &OHgr; to ◊W

Information Processing Letters
Eventually consistent failure detectors

Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
On the Quality of Service of Failure Detectors

IEEE Transactions on Computers
Practical byzantine fault tolerance and proactive recovery

ACM Transactions on Computer Systems (TOCS)
Revisiting the Paxos Algorithm

WDAG '97 Proceedings of the 11th International Workshop on Distributed Algorithms
Disk Paxos

DISC '00 Proceedings of the 14th International Conference on Distributed Computing
Stable Leader Election

DISC '01 Proceedings of the 15th International Conference on Distributed Computing
Implementation and Performance Evaluation of an Adaptable Failure Detector

DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Optimal Implementation of the Weakest Failure Detector for Solving Consensus

SRDS '00 Proceedings of the 19th IEEE Symposium on Reliable Distributed Systems
On implementing omega with weak reliability and synchrony assumptions

Proceedings of the twenty-second annual symposium on Principles of distributed computing
A gossip-style failure detection service

Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing

On implementing omega with weak reliability and synchrony assumptions

Proceedings of the twenty-second annual symposium on Principles of distributed computing
Communication-efficient leader election and consensus with limited link synchrony

Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
A short introduction to failure detectors for asynchronous distributed systems

ACM SIGACT News
Eventually consistent failure detectors

Journal of Parallel and Distributed Computing
Timeliness, failure-detectors, and consensus performance

Proceedings of the twenty-fifth annual ACM symposium on Principles of distributed computing
Time-Free and Timer-Based Assumptions Can Be Combined to Obtain Eventual Leadership

IEEE Transactions on Parallel and Distributed Systems
Coordinated data aggregation in wireless sensor networks using the Omega failure detector

Proceedings of the 3rd ACM international workshop on Performance evaluation of wireless ad hoc, sensor and ubiquitous networks
Implementing unreliable failure detectors with unknown membership

Information Processing Letters
Adaptive timeliness of consensus in presence of crash and timing faults

Journal of Parallel and Distributed Computing
On the Respective Power of ◊P and ◊S to Solve One-Shot Agreement Problems

IEEE Transactions on Parallel and Distributed Systems
Timeliness-based wait-freedom: a gracefully degrading progress condition

Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Agreement and consistency without knowing the number of processes

NOTERE '08 Proceedings of the 8th international conference on New technologies in distributed systems
Optimal message-driven implementations of omega with mute processes

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Review of DSN'08

ACM SIGACT News
Implementing the Omega failure detector in the crash-recovery failure model

Journal of Computer and System Sciences
Latency-aware leader election

Proceedings of the 2009 ACM symposium on Applied Computing
Two Consensus Algorithms with Atomic Registers and Failure Detector Ω

ICDCN '09 Proceedings of the 10th International Conference on Distributed Computing and Networking
Stabilizing leader election in partial synchronous systems with crash failures

Journal of Parallel and Distributed Computing
A simple and communication-efficient Omega algorithm in the crash-recovery model

Information Processing Letters
Brief Announcement: A Simple and Quiescent Omega Algorithm in the Crash-Recovery Model

SSS '09 Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems
Brief Announcement: Communication-Efficient Self-stabilizing Protocols for Spanning-Tree Construction

OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
Weak Synchrony Models and Failure Detectors for Message Passing (k-)Set Agreement

OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
Optimal message-driven implementation of omega with mute processes

SSS'06 Proceedings of the 8th international conference on Stabilization, safety, and security of distributed systems
From an intermittent rotating star to a leader

OPODIS'07 Proceedings of the 11th international conference on Principles of distributed systems
Relating stabilizing timing assumptions to stabilizing failure detectors regarding solvability and efficiency

SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
Robust stabilizing leader election

SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
Cooperative leader election algorithm for master/slave mobile ad hoc networks

WD'09 Proceedings of the 2nd IFIP conference on Wireless days
When consensus meets self-stabilization

Journal of Computer and System Sciences
The failure detector abstraction

ACM Computing Surveys (CSUR)
Communication-efficient and crash-quiescent Omega with unknown membership

Information Processing Letters
Approximation of δ-timeliness

SSS'10 Proceedings of the 12th international conference on Stabilization, safety, and security of distributed systems
Efficient fault tolerant consensus using preemptive token

ACAI '11 Proceedings of the International Conference on Advances in Computing and Artificial Intelligence
Communication-efficient leader election in crash-recovery systems

Journal of Systems and Software
When consensus meets self-stabilization

OPODIS'06 Proceedings of the 10th international conference on Principles of Distributed Systems
On the possibility and the impossibility of message-driven self-stabilizing failure detection

SSS'05 Proceedings of the 7th international conference on Self-Stabilizing Systems
Revisiting failure detection and consensus in omission failure environments

ICTAC'05 Proceedings of the Second international conference on Theoretical Aspects of Computing
Ω meets paxos: leader election and stability without eventual timely links

DISC'05 Proceedings of the 19th international conference on Distributed Computing
Implementing reliable distributed real-time systems with the Θ-model

OPODIS'05 Proceedings of the 9th international conference on Principles of Distributed Systems
Modular approach to replication for availability

Replication
Algorithms for extracting timeliness graphs

SIROCCO'10 Proceedings of the 17th international conference on Structural Information and Communication Complexity
Specifying and implementing an eventual leader service for dynamic systems

International Journal of Web and Grid Services
On the implementation of communication-optimal failure detectors

LADC'07 Proceedings of the Third Latin-American conference on Dependable Computing
Communication-Efficient self-stabilization in wireless networks

SSS'12 Proceedings of the 14th international conference on Stabilization, Safety, and Security of Distributed Systems
Tailoring consistency in group membership for mobile networks

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study the feasibility and cost of implementing Ω---a fundamental failure detector at the core of many algorithms---in systems with weak reliability and synchrony assumptions. Intuitively, Ω allows processes to eventually elect a common leader. We first give an algorithm that implements Ω in a weak system S where processes are synchronous, but: (a) any number of them may crash, and (b) only the output links of an unknown correct process are eventually timely (all other links can be asynchronous and/or lossy). This is in contrast to previous implementations of Ω which assume that a quadratic number of links are eventually timely, or systems that are strong enough to implement the eventually perfect failure detector P. We next show that implementing Ω in S is expensive: even if we want an implementation that tolerates just one process crash, all correct processes (except possibly one) must send messages forever; moreover, a quadratic number of links must carry messages forever. We then show that with a small additional assumption---the existence of some unknown correct process whose asynchronous links are lossy but fair---we can implement Ω efficiently: we give an algorithm for Ω such that eventually only one process (the elected leader) sends messages.