Reaching approximate agreement in the presence of faults
Journal of the ACM (JACM)
Reliable communication in the presence of failures
ACM Transactions on Computer Systems (TOCS)
On the minimal synchronism needed for distributed consensus
Journal of the ACM (JACM)
Fault-tolerant decision making in totally asynchronous distributed systems
PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
Consensus in the presence of partial synchrony
Journal of the ACM (JACM)
Reliable scheduling in a TMR database system
ACM Transactions on Computer Systems (TOCS)
PODC '88 Proceedings of the seventh annual ACM Symposium on Principles of distributed computing
Preserving and using context information in interprocess communication
ACM Transactions on Computer Systems (TOCS)
Knowledge and common knowledge in a distributed environment
Journal of the ACM (JACM)
Automatically increasing the fault-tolerance of distributed algorithms
Journal of Algorithms
Early-delivery atomic broadcast
PODC '90 Proceedings of the ninth annual ACM symposium on Principles of distributed computing
Implementing fault-tolerant services using the state machine approach: a tutorial
ACM Computing Surveys (CSUR)
Bounds on the time to reach agreement in the presence of timing uncertainty
STOC '91 Proceedings of the twenty-third annual ACM symposium on Theory of computing
Time and message efficient reliable broadcasts
Proceedings of the 4th international workshop on Distributed algorithms
Early—stopping distributed bidding and applications (preliminary version)
Proceedings of the 4th international workshop on Distributed algorithms
Using process groups to implement failure detection in asynchronous environments
PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
The weakest failure detector for solving consensus
PODC '92 Proceedings of the eleventh annual ACM symposium on Principles of distributed computing
Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Asynchronous consensus and broadcast protocols
Journal of the ACM (JACM)
Failure detectors and the wait-free hierarchy (extended abstract)
Proceedings of the fourteenth annual ACM symposium on Principles of distributed computing
Fault-tolerant broadcasts and related problems
Distributed systems (2nd Ed.)
Reaching Agreement in the Presence of Faults
Journal of the ACM (JACM)
The Byzantine Generals Problem
ACM Transactions on Programming Languages and Systems (TOPLAS)
ACM Transactions on Computer Systems (TOCS)
Delta Four: A Generic Architecture for Dependable Distributed Computing
Delta Four: A Generic Architecture for Dependable Distributed Computing
Using Failure Detectors to Solve Consensus in Asynchronous Sharde-Memory Systems (Extended Abstract)
WDAG '94 Proceedings of the 8th International Workshop on Distributed Algorithms
Revistiting the Relationship Between Non-Blocking Atomic Commitment and Consensus
WDAG '95 Proceedings of the 9th International Workshop on Distributed Algorithms
Another advantage of free choice (Extended Abstract): Completely asynchronous agreement protocols
PODC '83 Proceedings of the second annual ACM symposium on Principles of distributed computing
A Modular Approach to Fault-Tolerant Broadcasts and Related Problems
A Modular Approach to Fault-Tolerant Broadcasts and Related Problems
On the Impossibility of Group Membership
On the Impossibility of Group Membership
Totem: a fault-tolerant multicast group communication system
Communications of the ACM
The weakest failure detector for solving consensus
Journal of the ACM (JACM)
ACM SIGOPS Operating Systems Review
Dynamic voting for consistent primary components
PODC '97 Proceedings of the sixteenth annual ACM symposium on Principles of distributed computing
Round-by-round fault detectors (extended abstract): unifying synchrony and asynchrony
PODC '98 Proceedings of the seventeenth annual ACM symposium on Principles of distributed computing
The message classification model
PODC '98 Proceedings of the seventeenth annual ACM symposium on Principles of distributed computing
Structured derivations of consensus algorithms for failure detectors
PODC '98 Proceedings of the seventeenth annual ACM symposium on Principles of distributed computing
A Configurable Membership Service
IEEE Transactions on Computers
System support for object groups
Proceedings of the 13th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Coyote: a system for constructing fine-grain configurable communication services
ACM Transactions on Computer Systems (TOCS)
Client-Access Protocols for Replicated Services
IEEE Transactions on Software Engineering
A knowledge-theoretic analysis of uniform distributed coordination and failure detectors
Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing
Fundamentals of fault-tolerant distributed computing in asynchronous environments
ACM Computing Surveys (CSUR)
Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
k-set agreement with limited accuracy failure detectors
Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
Efficient atomic broadcast using deterministic merge
Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
Indulgent algorithms (preliminary version)
Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
Computing Global Functions in Asynchronous Distributed Systems with Perfect Failure Detectors
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Software Engineering
Time and message-efficient S-based consensus (brief announcement)
Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
Optimal implementation of the weakest failure detector for solving consensus (brief announcement)
Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
Middleware for dependable network services in partitionable distributed systems
ACM SIGOPS Operating Systems Review
Computing in the RAIN: A Reliable Array of Independent Nodes
IEEE Transactions on Parallel and Distributed Systems
Implementing E-Transactions with Asynchronous Replication
IEEE Transactions on Parallel and Distributed Systems
Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
Eventually consistent failure detectors
Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
Conditions on input vectors for consensus solvability in asynchronous distributed systems
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Group Communication in Partitionable Systems: Specification and Algorithms
IEEE Transactions on Software Engineering
Consensus-based fault-tolerant total order multicast
IEEE Transactions on Parallel and Distributed Systems
A hierarchy of conditions for consensus solvability
Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
On scalable and efficient distributed failure detectors
Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
Group communication specifications: a comprehensive study
ACM Computing Surveys (CSUR)
The SecureRing group communication system
ACM Transactions on Information and System Security (TISSEC)
EW 7 Proceedings of the 7th workshop on ACM SIGOPS European workshop: Systems support for worldwide applications
Fault Detection for Byzantine Quorum Systems
IEEE Transactions on Parallel and Distributed Systems
On the Quality of Service of Failure Detectors
IEEE Transactions on Computers
e-Transactions: End-to-End Reliability for Three-Tier Architectures
IEEE Transactions on Software Engineering
Moshe: A group membership service for WANs
ACM Transactions on Computer Systems (TOCS)
On the Quality of Service of Failure Detectors
IEEE Transactions on Computers
Active disk paxos with infinitely many processes
Proceedings of the twenty-first annual symposium on Principles of distributed computing
The inherent price of indulgence
Proceedings of the twenty-first annual symposium on Principles of distributed computing
Evaluating the running time of a communication round over the internet
Proceedings of the twenty-first annual symposium on Principles of distributed computing
Understanding perfect failure detectors
Proceedings of the twenty-first annual symposium on Principles of distributed computing
Early stopping in global data computation
Proceedings of the twenty-first annual symposium on Principles of distributed computing
Collaborative Group Membership
The Journal of Supercomputing - Special issue on computational issues in fluid dynamics optimization and simulation
A fault detection service for wide area distributed computations
Cluster Computing
Constructing Dependable Web Services
IEEE Internet Computing
Mastering Agreement Problems in Distributed Systems
IEEE Software
A Versatile Family of Consensus Protocols Based on Chandra-Toueg's Unreliable Failure Detectors
IEEE Transactions on Computers
Solving the Group Priority Inversion Problem in a Timed Asynchronous System
IEEE Transactions on Computers
The Timely Computing Base Model and Architecture
IEEE Transactions on Computers
Fast Asynchronous Uniform Consensus in Real-Time Distributed Systems
IEEE Transactions on Computers
Consensus-Based Fault-Tolerant Total Order Multicast
IEEE Transactions on Parallel and Distributed Systems
A Group Membership Algorithm with a Practical Specification
IEEE Transactions on Parallel and Distributed Systems
On the Asymptotical Optimality of Multilayered Decentralized Consensus Protocol
IEEE Transactions on Parallel and Distributed Systems
ACM SIGACT News
An introduction to oracles for asynchronous distributed systems
Future Generation Computer Systems - Parallel computing technologies (PaCT-2001)
The Database State Machine Approach
Distributed and Parallel Databases
Perfect Failure Detection in Timed Asynchronous Systems
IEEE Transactions on Computers
Semantically Reliable Multicast: Definition, Implementation, and Performance Evaluation
IEEE Transactions on Computers
Fault-Tolerant Mobile Agent Execution
IEEE Transactions on Computers
Muteness Failure Detectors: Specification and Implementation
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
Solving Agreement Problems with Weak Ordering Oracles
EDCC-4 Proceedings of the 4th European Dependable Computing Conference on Dependable Computing
An Efficient Solution to the k-Set Agreement Problem
EDCC-4 Proceedings of the 4th European Dependable Computing Conference on Dependable Computing
Fast Indulgent Consensus with Zero Degradation
EDCC-4 Proceedings of the 4th European Dependable Computing Conference on Dependable Computing
Trustless Grid Computing in ConCert
GRID '02 Proceedings of the Third International Workshop on Grid Computing
On Group Communication Systems: Insight, a Primer, and a Snapshot
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
Consensus Based on Strong Failure Detectors: A Time and Message-Efficient Protocol
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Dynamically Scaling Computer Networks
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
A Condition for k-Set Agreement in Asynchronous Distributed Systems
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Logically Instantaneous Communication on Top of Distributed Memory Parallel Machines
PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
Quiescent Uniform Reliable Broadcast as an Introduction to Failure Detector Oracles
PaCT '01 Proceedings of the 6th International Conference on Parallel Computing Technologies
Consensus in One Communication Step
PaCT '01 Proceedings of the 6th International Conference on Parallel Computing Technologies
Failure Detection vs Group Membership in Fault-Tolerant Distributed Systems: Hidden Trade-Offs
PAPM-PROBMIV '02 Proceedings of the Second Joint International Workshop on Process Algebra and Probabilistic Methods, Performance Modeling and Verification
The Agreement Problem Protocol Verification Environment
Proceedings of the 9th International SPIN Workshop on Model Checking of Software
Asynchronous Group Membership with Oracles
Proceedings of the 13th International Symposium on Distributed Computing
Proceedings of the 13th International Symposium on Distributed Computing
Revising the Weakest Failure Detector for Uniform Reliable Broadcast
Proceedings of the 13th International Symposium on Distributed Computing
Efficient Algorithms to Implement Unreliable Failure Detectors in Partially Synchronous Systems
Proceedings of the 13th International Symposium on Distributed Computing
A Dynamic Primary Configuration Group Communication Service
Proceedings of the 13th International Symposium on Distributed Computing
A Low-Latency Non-blocking Commit Service
DISC '01 Proceedings of the 15th International Conference on Distributed Computing
DISC '01 Proceedings of the 15th International Conference on Distributed Computing
Distributed Agreement and Its Relation with Error-Correcting Codes
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Condition-Based Protocols for Set Agreement Problems
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Ruminations on Domain-Based Reliable Broadcast
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Ad Hoc Membership for Scalable Applications
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Wait-Free n-Set Consensus When Inputs Are Restricted
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Failure Detection Lower Bounds on Registers and Consensus
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Analysis of an Election Problem for CSCW in Asynchronous Distributed Systems
EDCIS '02 Proceedings of the First International Conference on Engineering and Deployment of Cooperative Information Systems
A Hybrid Fault-Tolerant Scheme Based on Checkpointing in MASs
ICOIN '02 Revised Papers from the International Conference on Information Networking, Wireless Communications Technologies and Network Applications-Part II
The Weakest Failure Detector for Solving Election Problems in Asynchronous Distributed Systems
EurAsia-ICT '02 Proceedings of the First EurAsian Conference on Information and Communication Technology
Encapsulating Failure Detection: From Crash to Byzantine Failures
Ada-Europe '02 Proceedings of the 7th Ada-Europe International Conference on Reliable Software Technologies
Quorum-Based Replication in Asynchronous Crash-Recovery Distributed Systems (Research Note)
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Integrating Optimistic Virtual Synchrony to a CORBA Object Group Service
On the Move to Meaningful Internet Systems, 2002 - DOA/CoopIS/ODBASE 2002 Confederated International Conferences DOA, CoopIS and ODBASE 2002
Secure and Efficient Asynchronous Broadcast Protocols
CRYPTO '01 Proceedings of the 21st Annual International Cryptology Conference on Advances in Cryptology
Unreliable Failure Detectors with Limited Scope Accuracy and an Application to Consensus
Proceedings of the 19th Conference on Foundations of Software Technology and Theoretical Computer Science
Agreement Problems in Fault-Tolerant Distributed Systems
SOFSEM '01 Proceedings of the 28th Conference on Current Trends in Theory and Practice of Informatics Piestany: Theory and Practice of Informatics
Cooperating Mobile Agents and Stabilization
WSS '01 Proceedings of the 5th International Workshop on Self-Stabilizing Systems
(Im)Possibilities of Predicate Detection in Crash-Affected Systems
WSS '01 Proceedings of the 5th International Workshop on Self-Stabilizing Systems
Constructing Dependable Web Services
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
Topology-Aware Algorithms for Large-Scale Communication
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
Integrating Group Communication with Transactions for Implementing Persistent Replicated Objects
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
Time in Distributed System Models and Algorithms
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
Group Communication in Partitionable Distributed Systems
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
Improving Scalability of Replicated Services in Mobile Agent Systems
MA '02 Proceedings of the 6th International Conference on Mobile Agents
Avoiding Priority Inversion on the Processing of Requests by Active Replicated Servers
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Distributing Trust on the Internet
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
FATOMAS-A Fault-Tolerant Mobile Agent System Based on the Agent-Dependent Approach
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Proceedings of the 13th International Symposium on Distributed Computing
On the Importance of Having an Identity or is Consensus Really Universal?
DISC '00 Proceedings of the 14th International Conference on Distributed Computing
Computing in the RAIN: A Reliable Array of Independent Nodes
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
DISC '00 Proceedings of the 14th International Conference on Distributed Computing
Optimistic atomic broadcast: a pragmatic viewpoint
Theoretical Computer Science - Special issue: Distributed computing
Performance Evaluation of a Consensus Algorithm with Petri Nets
PNPM '97 Proceedings of the 6th International Workshop on Petri Nets and Performance Models
SRDS '96 Proceedings of the 15th Symposium on Reliable Distributed Systems
A General Framework to Solve Agreement Problems
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
Real-Time Fault-Tolerant Atomic Broadcast
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
Fault-Tolerant Replication Management in Large-Scale Distributed Storage Systems
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
Real-time dependable decisions in timed asynchronous distributed systems
WORDS '97 Proceedings of the 3rd Workshop on Object-Oriented Real-Time Dependable Systems - (WORDS '97)
Synchronous Consensus for Dependent Process Failures
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
A Generic Framework for Indulgent Consensus
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
On implementing omega with weak reliability and synchrony assumptions
Proceedings of the twenty-second annual symposium on Principles of distributed computing
Three-tier replication for FT-CORBA infrastructures
Software—Practice & Experience
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Enforcing Perfect Failure Detection
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Asynchronous consensus protocol for the unreliable un-fully connected network
ACM SIGOPS Operating Systems Review
ACM SIGACT news distributed computing column 11
ACM SIGACT News
IEEE Transactions on Knowledge and Data Engineering
Fault-tolerant grid architecture and practice
Journal of Computer Science and Technology - Grid computing
Conditions on input vectors for consensus solvability in asynchronous distributed systems
Journal of the ACM (JACM)
A Timeout-Based Message Ordering Protocol for a Lightweight Software Implementation of TMR Systems
IEEE Transactions on Parallel and Distributed Systems
An Extended Multi-Agent Negotiation Protocol
Autonomous Agents and Multi-Agent Systems
Synthesis of fault-tolerant concurrent programs
ACM Transactions on Programming Languages and Systems (TOPLAS)
Dealing efficiently with data-center disasters
Journal of Parallel and Distributed Computing
Consensus in byzantine asynchronous systems
Journal of Discrete Algorithms
Distributed recovery with K-optimistic logging
Journal of Parallel and Distributed Computing
Non-blocking atomic commit in asynchronous distributed systems with failure detectors
Distributed Computing
Hundreds of impossibility results for distributed computing
Distributed Computing - Papers in celebration of the 20th anniversary of PODC
Randomized protocols for asynchronous consensus
Distributed Computing - Papers in celebration of the 20th anniversary of PODC
Appraising two decades of distributed computing theory research
Distributed Computing - Papers in celebration of the 20th anniversary of PODC
The Information Structure of Indulgent Consensus
IEEE Transactions on Computers
A necessary and sufficient condition for transforming limited accuracy failure detectors
Journal of Computer and System Sciences
Performance Analysis of Adaptive Consensus Protocols Based on Slowness Oracles
ICDCSW '04 Proceedings of the 24th International Conference on Distributed Computing Systems Workshops - W7: EC (ICDCSW'04) - Volume 7
A flexible formal framework for masking/demasking faults
Information Sciences—Informatics and Computer Science: An International Journal
A weakest failure detector-based asynchronous consensus protocol for f
Information Processing Letters
Uniform consensus is harder than consensus
Journal of Algorithms
On the Implementation of Unreliable Failure Detectors in Partially Synchronous Systems
IEEE Transactions on Computers
Communication-efficient leader election and consensus with limited link synchrony
Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
The weakest failure detectors to solve certain fundamental problems in distributed computing
Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Group membership: a novel approach and the first single-round algorithm
Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Asynchronous group key exchange with failures
Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Condition-based consensus solvability: a hierarchy of conditions and efficient protocols
Distributed Computing
Information Systems - Special issue: Data quality in cooperative information systems
Failure Detection and Membership Management in Grid Environments
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Approaches to fault-tolerant and transactional mobile agent execution---an algorithmic view
ACM Computing Surveys (CSUR)
Failure detection and consensus in the crash-recovery model
Distributed Computing
Handling message semantics with Generic Broadcast protocols
Distributed Computing
Total order broadcast and multicast algorithms: Taxonomy and survey
ACM Computing Surveys (CSUR)
UbiCrawler: a scalable fully distributed web crawler
Software—Practice & Experience
RPC-V: Toward Fault-Tolerant RPC for Internet Connected Desktop Grids with Volatile Nodes
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Failure, connectivity and disconnection detectors
UbiMob '04 Proceedings of the 1st French-speaking conference on Mobility and ubiquity computing
IEEE Transactions on Computers
SCADA with Fault Tolerant CORBA on Fault Tolerant LANE ATM
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 16 - Volume 17
ACM Computing Surveys (CSUR)
Unification of Transactions and Replication in Three-Tier Architectures Based on CORBA
IEEE Transactions on Dependable and Secure Computing
Simple and Efficient Oracle-Based Consensus Protocols for Asynchronous Byzantine Systems
IEEE Transactions on Dependable and Secure Computing
The mobile groups approach for the coordination of mobile agents
Journal of Parallel and Distributed Computing
Eventually consistent failure detectors
Journal of Parallel and Distributed Computing
Reliable and total order broadcast in the crash-recovery model
Journal of Parallel and Distributed Computing
Mutual exclusion in asynchronous systems with failure detectors
Journal of Parallel and Distributed Computing
The combined power of conditions and failure detectors to solve asynchronous set agreement
Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
The weakest failure detector to solve nonuniform consensus
Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Consensus and collision detectors in wireless Ad Hoc networks
Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Brief announcement: minimal system conditions to implement unreliable failure detectors
Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Toward a theory of transactional contention managers
Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Solving Vector Consensus with a Wormhole
IEEE Transactions on Parallel and Distributed Systems
A theory of system behaviour in the presence of node and link failures
CONCUR 2005 - Concurrency Theory
Détection de partition pour la gestion de groupes en environnement mobile
UbiMob '05 Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
Putting Detectors in Their Place
SEFM '05 Proceedings of the Third IEEE International Conference on Software Engineering and Formal Methods
EURASIP Journal on Wireless Communications and Networking
From Set Membership to Group Membership: A Separation of Concerns
IEEE Transactions on Dependable and Secure Computing
Illustrating the impossibility of crash-tolerant consensus in asynchronous systems
ACM SIGOPS Operating Systems Review
Proactive resilience through architectural hybridization
Proceedings of the 2006 ACM symposium on Applied computing
Service interface: a new abstraction for implementing and composing protocols
Proceedings of the 2006 ACM symposium on Applied computing
ALTER: first step towards dependable grids
Proceedings of the 2006 ACM symposium on Applied computing
Active disk Paxos with infinitely many processes
Distributed Computing - Special issue: PODC 02
The inherent price of indulgence
Distributed Computing - Special issue: PODC 02
Irreducibility and additivity of set agreement-oriented failure detector classes
Proceedings of the twenty-fifth annual ACM symposium on Principles of distributed computing
Timeliness, failure-detectors, and consensus performance
Proceedings of the twenty-fifth annual ACM symposium on Principles of distributed computing
Synchronizing without locks is inherently expensive
Proceedings of the twenty-fifth annual ACM symposium on Principles of distributed computing
A knowledge-theoretic analysis of uniform distributed coordination and failure detectors
Distributed Computing
Low complexity Byzantine-resilient consensus
Distributed Computing
Fully Distributed Three-Tier Active Software Replication
IEEE Transactions on Parallel and Distributed Systems
Time-Free and Timer-Based Assumptions Can Be Combined to Obtain Eventual Leadership
IEEE Transactions on Parallel and Distributed Systems
Condition Adaptation in Synchronous Consensus
IEEE Transactions on Computers
Detecting and Isolating Malicious Routers
IEEE Transactions on Dependable and Secure Computing
Tight bounds for k-set agreement with limited-scope failure detectors
Distributed Computing - Special issue: DISC 03
On the importance of having an identity or, is consensus really universal?
Distributed Computing - Special issue: DISC 04
Light-weight leases for storage-centric coordination
International Journal of Parallel Programming
Construction of a fault-tolerant wireless communication topology using distributed agreement
DIWANS '06 Proceedings of the 2006 workshop on Dependability issues in wireless ad hoc networks and sensor networks
Coordinated data aggregation in wireless sensor networks using the Omega failure detector
Proceedings of the 3rd ACM international workshop on Performance evaluation of wireless ad hoc, sensor and ubiquitous networks
Implementing unreliable failure detectors with unknown membership
Information Processing Letters
Implementing fault-tolerance in real-time systems by automatic program transformations
EMSOFT '06 Proceedings of the 6th ACM & IEEE International conference on Embedded software
Orchestrating fair exchanges between mutually distrustful web services
Proceedings of the 3rd ACM workshop on Secure web services
Asynchronous bounded lifetime failure detectors
Information Processing Letters
Global computing in a dynamic network of tuple spaces
Science of Computer Programming
Research note: From ♢W to Ω: A simple bounded quiescent reliable broadcast-based transformation
Journal of Parallel and Distributed Computing
Managed Agreement: Generalizing two fundamental distributed agreement problems
Information Processing Letters
Worm-IT - A wormhole-based intrusion-tolerant group communication system
Journal of Systems and Software
A weakly-adaptive condition-based consensus algorithm in asynchronous distributed systems
Information Processing Letters
Harmful dogmas in fault tolerant distributed computing
ACM SIGACT News
The perfectly synchronized round-based model of distributed computing
Information and Computation
On modeling and tolerating incorrect software
Journal of High Speed Networks - Self-Stabilizing Systems, Part 2
Design and implementation of a secure wide-area object middleware
Computer Networks: The International Journal of Computer and Telecommunications Networking
Adaptive timeliness of consensus in presence of crash and timing faults
Journal of Parallel and Distributed Computing
Evaluation of the QoS of crash-recovery failure detection
Proceedings of the 2007 ACM symposium on Applied computing
A new adaptive accrual failure detector for dependable distributed systems
Proceedings of the 2007 ACM symposium on Applied computing
The notion of a timed register and its application to indulgent synchronization
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
The case for Byzantine fault detection
HOTDEP'06 Proceedings of the 2nd conference on Hot Topics in System Dependability - Volume 2
FUSE: lightweight guaranteed distributed failure notification
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Ensuring e-Transaction with Asynchronous and Uncoordinated Application Server Replicas
IEEE Transactions on Parallel and Distributed Systems
On the Respective Power of ◊P and ◊S to Solve One-Shot Agreement Problems
IEEE Transactions on Parallel and Distributed Systems
A Parsimonious Approach for Obtaining Resource-Efficient and Trustworthy Execution
IEEE Transactions on Dependable and Secure Computing
An Adaptive Programming Model for Fault-Tolerant Distributed Computing
IEEE Transactions on Dependable and Secure Computing
The election problem in asynchronous distributed systems with bounded faulty processes
The Journal of Supercomputing
A priority-based distributed group mutual exclusion algorithm when group access is non-uniform
Journal of Parallel and Distributed Computing
Proceedings of the conference on Design, automation and test in Europe
Using the strategy design pattern to compose reliable distributed protocols
COOTS'97 Proceedings of the 3rd conference on USENIX Conference on Object-Oriented Technologies (COOTS) - Volume 3
Asynchronous Agreement and Its Relation with Error-Correcting Codes
IEEE Transactions on Computers
Design and Performance Evaluation of Efficient Consensus Protocols for Mobile Ad Hoc Networks
IEEE Transactions on Computers
Latency and bandwidth-minimizing failure detectors
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Sprint: a middleware for high-performance transaction processing
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
On the weakest failure detector ever
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Failure detectors are schedulers
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Towards the minimal synchrony for byzantine consensus
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Sinfonia: a new paradigm for building scalable distributed systems
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
PeerReview: practical accountability for distributed systems
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
QoS management in distributed service oriented systems
PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
Performance of memory reclamation for lockless synchronization
Journal of Parallel and Distributed Computing
Online Diagnosis and Recovery: On the Choice and Impact of Tuning Parameters
IEEE Transactions on Dependable and Secure Computing
Automated Rule-Based Diagnosis through a Distributed Monitor System
IEEE Transactions on Dependable and Secure Computing
Pronto: High availability for standard off-the-shelf databases
Journal of Parallel and Distributed Computing
Reliable directory service and message delivery for large-scale mobile agent systems
TELE-INFO'07 Proceedings of the 6th WSEAS Int. Conference on Telecommunications and Informatics
Adaptive clustering for scalable key management in dynamic group communications
International Journal of Security and Networks
Message and time efficient consensus protocols for synchronous distributed systems
Journal of Parallel and Distributed Computing
Scheduling distributable real-time threads in the presence of crash failures and message losses
Proceedings of the 2008 ACM symposium on Applied computing
Total order broadcast on pervasive systems
Proceedings of the 2008 ACM symposium on Applied computing
Providing dependability for web services
Proceedings of the 2008 ACM symposium on Applied computing
On termination detection in crash-prone distributed systems with failure detectors
Journal of Parallel and Distributed Computing
A theory of system behaviour in the presence of node and link failure
Information and Computation
Implementing fault-tolerance in real-time programs by automatic program transformations
ACM Transactions on Embedded Computing Systems (TECS)
Using asynchrony and zero degradation to speed up indulgent consensus protocols
Journal of Parallel and Distributed Computing
On obstruction-free transactions
Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
FaTLease: scalable fault-tolerant lease negotiation with paxos
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Key-based consistency and availability in structured overlay networks
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
A synchronization protocol for supporting peer-to-peer multiplayer online games in overlay networks
Proceedings of the second international conference on Distributed event-based systems
Jgroup-ARM: a distributed object group platform with autonomous replication management
Software—Practice & Experience
D3S: debugging deployed distributed systems
NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
Modularity: a first class concept to address distributed systems
ACM SIGACT News
A methodology to design arbitrary failure detectors for distributed protocols
Journal of Systems Architecture: the EUROMICRO Journal
Failure detectors in loosely named systems
Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Every problem has a weakest failure detector
Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Sharing is harder than agreeing
Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Timeliness-based wait-freedom: a gracefully degrading progress condition
Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Optimal failure detection with low sporadic overhead and communication locality
Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
An impossibility about failure detectors in the iterated immediate snapshot model
Information Processing Letters
Effective service replication mechanisms exploiting agent mobility
SEPADS'08 Proceedings of the 7th WSEAS International Conference on Software Engineering, Parallel and Distributed Systems
Agreement and consistency without knowing the number of processes
NOTERE '08 Proceedings of the 8th international conference on New technologies in distributed systems
Agreement without knowing everybody: a first step to dynamicity
NOTERE '08 Proceedings of the 8th international conference on New technologies in distributed systems
A Fault Tolerant Agent Communication Language for Supporting Web Agent Interaction
Agent Communication II
The DHCP Failover Protocol: A Formal Perspective
FORTE '07 Proceedings of the 27th IFIP WG 6.1 international conference on Formal Techniques for Networked and Distributed Systems
The Iterated Restricted Immediate Snapshot Model
COCOON '08 Proceedings of the 14th annual international conference on Computing and Combinatorics
Design, Implementation and Deployment of State Machines Using a Generative Approach
Architecting Dependable Systems V
Fair Exchange Is Incomparable to Consensus
Proceedings of the 5th international colloquium on Theoretical Aspects of Computing
Deterministic Models of Communication Faults
MFCS '08 Proceedings of the 33rd international symposium on Mathematical Foundations of Computer Science
Local Terminations and Distributed Computability in Anonymous Networks
DISC '08 Proceedings of the 22nd international symposium on Distributed Computing
Local Maps: New Insights into Mobile Agent Algorithms
DISC '08 Proceedings of the 22nd international symposium on Distributed Computing
Using Bounded Model Checking to Verify Consensus Algorithms
DISC '08 Proceedings of the 22nd international symposium on Distributed Computing
Designing Fault-Tolerant Component Based Applications with a Model Driven Approach
SEUS '08 Proceedings of the 6th IFIP WG 10.2 international workshop on Software Technologies for Embedded and Ubiquitous Systems
Locks Considered Harmful: A Look at Non-traditional Synchronization
SEUS '08 Proceedings of the 6th IFIP WG 10.2 international workshop on Software Technologies for Embedded and Ubiquitous Systems
Uncertainty Management for the Retrieval of Economic Information from Distributed Markets
SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Disassembling real-time fault-tolerant programs
EMSOFT '08 Proceedings of the 8th ACM international conference on Embedded software
A general characterization of indulgence
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Key-based consistency and availability in structured overlay networks
Proceedings of the 3rd international conference on Scalable information systems
LIDeA: a distributed lightweight intrusion detection architecture for sensor networks
Proceedings of the 4th international conference on Security and privacy in communication netowrks
Optimal message-driven implementations of omega with mute processes
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Using eventually consistent compasses to gather memory-less mobile robots with limited visibility
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A group membership service for large-scale grids
Proceedings of the 6th international workshop on Middleware for grid computing
APNOMS '08 Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service Management
Failure Detection Service for Large Scale Systems
KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Correctness Criteria for Database Replication: Theoretical and Practical Aspects
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
Universe Detectors for Sybil Defense in Ad Hoc Wireless Networks
SSS '08 Proceedings of the 10th International Symposium on Stabilization, Safety, and Security of Distributed Systems
Tiara: A Self-stabilizing Deterministic Skip List
SSS '08 Proceedings of the 10th International Symposium on Stabilization, Safety, and Security of Distributed Systems
The Asynchronous Bounded-Cycle Model
SSS '08 Proceedings of the 10th International Symposium on Stabilization, Safety, and Security of Distributed Systems
Grouping algorithms for scalable self-monitoring distributed systems
Autonomics '08 Proceedings of the 2nd International Conference on Autonomic Computing and Communication Systems
Byzantine Consensus with Unknown Participants
OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
With Finite Memory Consensus Is Easier Than Reliable Broadcast
OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
Fault-Tolerant Flocking in a k-Bounded Asynchronous System
OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
Solving Atomic Multicast When Groups Crash
OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
An Unreliable Failure Detector for Unknown and Mobile Networks
OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
Theoretical Computer Science
Implementing the Omega failure detector in the crash-recovery failure model
Journal of Computer and System Sciences
Semantic partitioning of peer-to-peer search space
Computer Communications
Failure detectors for wireless sensor-actuator systems
Ad Hoc Networks
A step towards a new generation of group communication systems
Proceedings of the ACM/IFIP/USENIX 2003 International Conference on Middleware
GenQA: automated addition of architectural quality attribute support for Java software?
Proceedings of the 2009 ACM symposium on Applied Computing
Two Consensus Algorithms with Atomic Registers and Failure Detector Ω
ICDCN '09 Proceedings of the 10th International Conference on Distributed Computing and Networking
Design of the notification system for failure detectors
International Journal of High Performance Computing and Networking
Global data computation in chordal rings
Journal of Parallel and Distributed Computing
A Generic Group Communication Approach for Hybrid Distributed Systems
DAIS '09 Proceedings of the 9th IFIP WG 6.1 International Conference on Distributed Applications and Interoperable Systems
On Process-Algebraic Proof Methods for Fault Tolerant Distributed Systems
FMOODS '09/FORTE '09 Proceedings of the Joint 11th IFIP WG 6.1 International Conference FMOODS '09 and 29th IFIP WG 6.1 International Conference FORTE '09 on Formal Techniques for Distributed Systems
A Reliable and Efficient Pedal Back Data Disseminating Scheme for Ad-Hoc WSNs
ISA '09 Proceedings of the 3rd International Conference and Workshops on Advances in Information Security and Assurance
Extracting quorum failure detectors
Proceedings of the 28th ACM symposium on Principles of distributed computing
The weakest failure detector for solving k-set agreement
Proceedings of the 28th ACM symposium on Principles of distributed computing
Partial synchrony based on set timeliness
Proceedings of the 28th ACM symposium on Principles of distributed computing
Brief announcement: weakest failure detectors via an egg-laying simulation
Proceedings of the 28th ACM symposium on Principles of distributed computing
The weakest failure detector for wait-free dining under eventual weak exclusion
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Formal Model--Driven Design of Distributed Algorithms
Electronic Notes in Theoretical Computer Science (ENTCS)
FaTLease: scalable fault-tolerant lease negotiation with Paxos
Cluster Computing
Sinfonia: A new paradigm for building scalable distributed systems
ACM Transactions on Computer Systems (TOCS)
On the round complexity of Byzantine agreement without initial set-up
Information and Computation
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
IEEE Journal on Selected Areas in Communications - Special issue on wireless and pervasive communications for healthcare
A simple and communication-efficient Omega algorithm in the crash-recovery model
Information Processing Letters
SSS '09 Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems
A Stability Criteria Membership Protocol for Ad Hoc Networks
OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part I
OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
The Minimum Information about Failures for Solving Non-local Tasks in Message-Passing Systems
OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
Enhanced Fault-Tolerance through Byzantine Failure Detection
OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
Weak Synchrony Models and Failure Detectors for Message Passing (k-)Set Agreement
OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
CCNC'09 Proceedings of the 6th IEEE Conference on Consumer Communications and Networking Conference
SAFE: scalable autonomous fault-tolerant Ethernet
ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 1
Asynchronous bounded lifetime failure detectors
Information Processing Letters
ACM Transactions on Programming Languages and Systems (TOPLAS)
Tight failure detection bounds on atomic object implementations
Journal of the ACM (JACM)
International Journal of Parallel Programming
Semi-passive replication and Lazy Consensus
Journal of Parallel and Distributed Computing
Active replication of software components
SEM'02 Proceedings of the 3rd international conference on Software engineering and middleware
A general characterization of indulgence
SSS'06 Proceedings of the 8th international conference on Stabilization, safety, and security of distributed systems
Optimal message-driven implementation of omega with mute processes
SSS'06 Proceedings of the 8th international conference on Stabilization, safety, and security of distributed systems
A dependable intrusion detection architecture based on agreement services
SSS'06 Proceedings of the 8th international conference on Stabilization, safety, and security of distributed systems
Brief announcement: self-stabilizing spanning tree algorithm for large scale systems
SSS'06 Proceedings of the 8th international conference on Stabilization, safety, and security of distributed systems
Brief announcement: wait-free dining for eventual weak exclusion
SSS'06 Proceedings of the 8th international conference on Stabilization, safety, and security of distributed systems
GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
A framework of safe stabilization
SSS'03 Proceedings of the 6th international conference on Self-stabilizing systems
Using failure injection mechanisms to experiment and evaluate a grid failure detector
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
A fault tolerance bisimulation proof for consensus
ESOP'07 Proceedings of the 16th European conference on Programming
ARCS'07 Proceedings of the 20th international conference on Architecture of computing systems
ICCS'03 Proceedings of the 1st international conference on Computational science: PartI
DARX: a self-healing framework for agents
Proceedings of the 12th Monterey conference on Reliable systems on unreliable networked platforms
Dynamic system reconfiguration via service composition for dependable computing
Proceedings of the 12th Monterey conference on Reliable systems on unreliable networked platforms
A fault-tolerant software architecture for component-based systems
Architecting dependable systems
Design and performance of a generic consensus component for critical distributed applications
Ada-Europe'07 Proceedings of the 12th international conference on Reliable software technologies
Learning from the past for resolving dilemmas of asynchrony
ACM SIGOPS Operating Systems Review
Asynchronous Byzantine consensus with 2f+1 processes
Proceedings of the 2010 ACM Symposium on Applied Computing
On distributed real-time scheduling in networked embedded systems in the presence of crash failures
SEUS'07 Proceedings of the 5th IFIP WG 10.2 international conference on Software technologies for embedded and ubiquitous systems
Consensus-driven distributable thread scheduling in networked embedded systems
EUC'07 Proceedings of the 2007 international conference on Embedded and ubiquitous computing
Byzantine consensus with few synchronous links
OPODIS'07 Proceedings of the 11th international conference on Principles of distributed systems
From an intermittent rotating star to a leader
OPODIS'07 Proceedings of the 11th international conference on Principles of distributed systems
The circular two-phase commit protocol
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Towards timely ACID transactions in DBMS
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
Decentralized, connectivity-preserving, and cost-effective structured overlay maintenance
SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
Secure failure detection in TrustedPals
SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
Global predicate detection in distributed systems with small faults
SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
The truth system: can a system of lying processes stabilize?
SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
On the probabilistic omission adversary
SSS'07 Proceedings of the 9h international conference on Stabilization, safety, and security of distributed systems
Implementation and performance evaluation of an adaptable failure detector in iSCSI
APPT'07 Proceedings of the 7th international conference on Advanced parallel processing technologies
The building blocks of consensus
ICDCN'08 Proceedings of the 9th international conference on Distributed computing and networking
On optimal probabilistic asynchronous Byzantine agreement
ICDCN'08 Proceedings of the 9th international conference on Distributed computing and networking
Wait-free dining under eventual weak exclusion
ICDCN'08 Proceedings of the 9th international conference on Distributed computing and networking
Skip ring topology in fast failure detection service
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Issues on the design of efficient fail-safe fault tolerance
ISSRE'09 Proceedings of the 20th IEEE international conference on software reliability engineering
Algorithms for sensor and ad hoc networks: advanced lectures
Algorithms for sensor and ad hoc networks: advanced lectures
Future directions in distributed computing
Comparing the atomic commitment and consensus problems
Future directions in distributed computing
Open questions on consensus performance in well-behaved runs
Future directions in distributed computing
Challenges in evaluating distributed algorithms
Future directions in distributed computing
Dissecting distributed computations
Future directions in distributed computing
Uncertainty and predictability: can they be reconciled?
Future directions in distributed computing
A data-centric approach for scalable state machine replication
Future directions in distributed computing
Model-driven construction of embedded applications based on reusable building blocks: an example
SDL'09 Proceedings of the 14th international SDL conference on Design for motes and mobiles
DISC'09 Proceedings of the 23rd international conference on Distributed computing
Randomization can be a healer: consensus with dynamic omission failures
DISC'09 Proceedings of the 23rd international conference on Distributed computing
On the existence of weakest failure detectors for mutual exclusion and k-exclusion
DISC'09 Proceedings of the 23rd international conference on Distributed computing
Crash-quiescent failure detection
DISC'09 Proceedings of the 23rd international conference on Distributed computing
The price of anonymity: optimal consensus despite asynchrony, crash and anonymity
DISC'09 Proceedings of the 23rd international conference on Distributed computing
Brief announcement: the minimum failure detector for non-local tasks in message-passing systems
DISC'09 Proceedings of the 23rd international conference on Distributed computing
Throughput optimal total order broadcast for cluster environments
ACM Transactions on Computer Systems (TOCS)
Eventually linearizable shared objects
Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Enhanced Paxos Commit for Transactions on DHTs
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Information Processing Letters
When consensus meets self-stabilization
Journal of Computer and System Sciences
HotOS'09 Proceedings of the 12th conference on Hot topics in operating systems
Network imprecision: a new consistency metric for scalable monitoring
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Mencius: building efficient replicated state machines for WANs
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Weak consistency as a last resort
Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
Available and safe message freshness detection algorithm
International Journal of Critical Computer-Based Systems
Distributed computing: a Glimmer of a theory
Algorithms and theory of computation handbook
The failure detector abstraction
ACM Computing Surveys (CSUR)
Fast asynchronous consensus with optimal resilience
DISC'10 Proceedings of the 24th international conference on Distributed computing
Anonymous asynchronous systems: the case of failure detectors
DISC'10 Proceedings of the 24th international conference on Distributed computing
Brief announcement: failure detectors encapsulate fairness
DISC'10 Proceedings of the 24th international conference on Distributed computing
Fault-tolerant flocking for a group of autonomous mobile robots
Journal of Systems and Software
An approach for designing and assessing detectors for dependable component-based systems
HASE'04 Proceedings of the Eighth IEEE international conference on High assurance systems engineering
Eventually consistent failure detectors
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
On the impossibility of implementing perpetual failure detectors in partially synchronous systems
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Structural and algorithmic issues of dynamic protocol update
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
OPEN EDEN: a portable fault tolerant CORBA architecture
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
Communication-efficient and crash-quiescent Omega with unknown membership
Information Processing Letters
Autonomous and scalable failure detection in distributed systems
International Journal of Autonomous and Adaptive Communications Systems
SSS'10 Proceedings of the 12th international conference on Stabilization, safety, and security of distributed systems
Safe flocking in spite of actuator faults
SSS'10 Proceedings of the 12th international conference on Stabilization, safety, and security of distributed systems
Communication-efficient failure detection and consensus in omission environments
Information Processing Letters
Signature-free broadcast-based intrusion tolerance: never decide a Byzantine value
OPODIS'10 Proceedings of the 14th international conference on Principles of distributed systems
Failure detectors encapsulate fairness
OPODIS'10 Proceedings of the 14th international conference on Principles of distributed systems
(anti-Ωx × Σz)-based k-set agreement algorithms
OPODIS'10 Proceedings of the 14th international conference on Principles of distributed systems
Turning adversaries into friends: simplified, made constructive, and extended
OPODIS'10 Proceedings of the 14th international conference on Principles of distributed systems
Architecture and protocol support for providing consensus as a fault-tolerant virtualised service
Proceedings of the 8th International Conference on Frontiers of Information Technology
A necessary and sufficient synchrony condition for solving Byzantine consensus in symmetric networks
ICDCN'11 Proceedings of the 12th international conference on Distributed computing and networking
Proceedings of the Third International Workshop on Reliability, Availability, and Security
The impossibility of boosting distributed service resilience
Information and Computation
Rewriting: sleeping to get there faster
HotDep'05 Proceedings of the First conference on Hot topics in system dependability
The case for byzantine fault detection
HotDep'06 Proceedings of the Second conference on Hot topics in system dependability
Ensuring content integrity for untrusted peer-to-peer content distribution networks
NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
EWDC '11 Proceedings of the 13th European Workshop on Dependable Computing
A new approach to fault-tolerant mobile agent execution in distributed systems
EC'05 Proceedings of the 6th WSEAS international conference on Evolutionary computing
Modeling fault-tolerant and reliable mobile agent execution in distributed systems
EC'05 Proceedings of the 6th WSEAS international conference on Evolutionary computing
A new approach for evaluation fault-tolerant mobile agent execution in distributed systems
EC'05 Proceedings of the 6th WSEAS international conference on Evolutionary computing
A new approach for evaluation fault-tolerant mobile agent execution in distributed systems
EC'05 Proceedings of the 6th WSEAS international conference on Evolutionary computing
Structuring unreliable radio networks
Proceedings of the 30th annual ACM SIGACT-SIGOPS symposium on Principles of distributed computing
The universe of symmetry breaking tasks
Proceedings of the 30th annual ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Rectifying orphan components using group-failover in distributed real-time and embedded systems
Proceedings of the 14th international ACM Sigsoft symposium on Component based software engineering
Efficient fault tolerant consensus using preemptive token
ACAI '11 Proceedings of the International Conference on Advances in Computing and Artificial Intelligence
The Price of Anonymity: Optimal Consensus Despite Asynchrony, Crash, and Anonymity
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Proceedings of the 11th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
The Asynchronous Bounded-Cycle model
Theoretical Computer Science
Resource-aware junction trees for efficient multi-agent coordination
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Multicast with aggregated deliveries
Proceedings of the First International Workshop on Algorithms and Models for Distributed Event Processing
The universe of symmetry breaking tasks
SIROCCO'11 Proceedings of the 18th international conference on Structural information and communication complexity
A failure detector for wireless networks with unknown membership
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
What model and what conditions to implement unreliable failure detectors in dynamic networks?
Proceedings of the 3rd International Workshop on Theoretical Aspects of Dynamic Distributed Systems
Communication-efficient leader election in crash-recovery systems
Journal of Systems and Software
FaDe: RESTful service for failure detection in SOA environment
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
A log-scaling fault tolerant agreement algorithm for a fault tolerant MPI
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Fault tolerance in an industrial seismic processing application for multicore clusters
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Run-through stabilization: an MPI proposal for process fault tolerance
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Detecting failures in distributed systems with the Falcon spy network
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Semantic communication for simple goals is equivalent to on-line learning
ALT'11 Proceedings of the 22nd international conference on Algorithmic learning theory
SSS'11 Proceedings of the 13th international conference on Stabilization, safety, and security of distributed systems
Relations linking failure detectors associated with k-set agreement in message-passing systems
SSS'11 Proceedings of the 13th international conference on Stabilization, safety, and security of distributed systems
Brief announcement: leaderless byzantine paxos
DISC'11 Proceedings of the 25th international conference on Distributed computing
Brief announcement: on the meaning of solving a task with a failure detector
DISC'11 Proceedings of the 25th international conference on Distributed computing
Experimental evaluation of a failure detection service based on a gossip strategy
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part II
Multiwriter Consistency Conditions for Shared Memory Registers
SIAM Journal on Computing
An ACL for specifying fault-tolerant protocols
AI*IA'05 Proceedings of the 9th conference on Advances in Artificial Intelligence
In search of the holy grail: looking for the weakest failure detector for wait-free set agreement
OPODIS'06 Proceedings of the 10th international conference on Principles of Distributed Systems
When consensus meets self-stabilization
OPODIS'06 Proceedings of the 10th international conference on Principles of Distributed Systems
Optimistic algorithms for partial database replication
OPODIS'06 Proceedings of the 10th international conference on Principles of Distributed Systems
OPODIS'06 Proceedings of the 10th international conference on Principles of Distributed Systems
Self-stabilizing leader election in networks of finite-state anonymous agents
OPODIS'06 Proceedings of the 10th international conference on Principles of Distributed Systems
Using agreement services in grid computing
ISPA'06 Proceedings of the 2006 international conference on Frontiers of High Performance Computing and Networking
TransMAN: a group communication system for MANETs
ICDCN'06 Proceedings of the 8th international conference on Distributed Computing and Networking
Analyzing fault aware collective performance in a process fault tolerant MPI
Parallel Computing
A hybrid fault tolerance scheme for EasyGrid MPI applications
Proceedings of the 9th International Workshop on Middleware for Grids, Clouds and e-Science
A topological condition for solving fair exchange in byzantine environments
ICICS'06 Proceedings of the 8th international conference on Information and Communications Security
Revisiting the election problem in asynchronous distributed systems
APPT'05 Proceedings of the 6th international conference on Advanced Parallel Processing Technologies
On the possibility and the impossibility of message-driven self-stabilizing failure detection
SSS'05 Proceedings of the 7th international conference on Self-Stabilizing Systems
TCP-ABC: from multiple TCP connections to atomic broadcasting
NPC'05 Proceedings of the 2005 IFIP international conference on Network and Parallel Computing
Stabilizing consensus in mobile networks
DCOSS'06 Proceedings of the Second IEEE international conference on Distributed Computing in Sensor Systems
The notion of veto number for distributed agreement problems
IWDC'04 Proceedings of the 6th international conference on Distributed Computing
An efficient reliable architecture for application layer anycast service
ICA3PP'05 Proceedings of the 6th international conference on Algorithms and Architectures for Parallel Processing
Group communication: from practice to theory
SOFSEM'06 Proceedings of the 32nd conference on Current Trends in Theory and Practice of Computer Science
Solving election problem in asynchronous distributed systems
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
LATIN'10 Proceedings of the 9th Latin American conference on Theoretical Informatics
The election problem in asynchronous distributed systems with bounded faulty processes
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Optimal and practical WAB-based consensus algorithms
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Run-time switching between total order algorithms
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
One-step consensus solvability
DISC'06 Proceedings of the 20th international conference on Distributed Computing
The weakest failure detectors to boost obstruction-freedom
DISC'06 Proceedings of the 20th international conference on Distributed Computing
Robust network supercomputing with malicious processes
DISC'06 Proceedings of the 20th international conference on Distributed Computing
Low-latency atomic broadcast in the presence of contention
DISC'06 Proceedings of the 20th international conference on Distributed Computing
Brief announcement: communication-optimal implementation of failure detector class ⋄P
DISC'06 Proceedings of the 20th international conference on Distributed Computing
Extended membership problem for open groups: specification and solution
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Replication predicates for dependent-failure algorithms
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
A hybrid message Logging-CIC protocol for constrained checkpointability
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
On correctness of dynamic protocol update
FMOODS'05 Proceedings of the 7th IFIP WG 6.1 international conference on Formal Methods for Open Object-Based Distributed Systems
A secure checkpointing protocol for survivable server design
ICDCIT'04 Proceedings of the First international conference on Distributed Computing and Internet Technology
SecondSite: disaster tolerance as a service
VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
Adaptive fault monitoring in fault tolerant CORBA
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Global computing in a dynamic network of tuple spaces
COORDINATION'05 Proceedings of the 7th international conference on Coordination Models and Languages
Building and using quorums despite any number of process of crashes
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
Failure detection with booting in partially synchronous systems
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
An architectural framework for detecting process hangs/crashes
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
Novel generic middleware building blocks for dependable modular avionics systems
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
An improved algorithm for adaptive condition-based consensus
SIROCCO'05 Proceedings of the 12th international conference on Structural Information and Communication Complexity
Proof-based system engineering using a virtual system model
ISAS'05 Proceedings of the Second international conference on Service Availability
A formal model for fault-tolerance in distributed systems
SAFECOMP'05 Proceedings of the 24th international conference on Computer Safety, Reliability, and Security
Performance tuning of failure detectors in wireless ad-hoc networks: modelling and experiments
EPEW'05/WS-FM'05 Proceedings of the 2005 international conference on European Performance Engineering, and Web Services and Formal Methods, international conference on Formal Techniques for Computer Systems and Business Processes
A practical distributed mutual exclusion protocol in dynamic peer-to-peer systems
IPTPS'04 Proceedings of the Third international conference on Peer-to-Peer Systems
Revisiting failure detection and consensus in omission failure environments
ICTAC'05 Proceedings of the Second international conference on Theoretical Aspects of Computing
On conspiracies and hyperfairness in distributed computing
DISC'05 Proceedings of the 19th international conference on Distributed Computing
Obstruction-Free algorithms can be practically wait-free
DISC'05 Proceedings of the 19th international conference on Distributed Computing
Efficient reduction for wait-free termination detection in a crash-prone distributed system
DISC'05 Proceedings of the 19th international conference on Distributed Computing
Computing with reads and writes in the absence of step contention
DISC'05 Proceedings of the 19th international conference on Distributed Computing
(Almost) all objects are universal in message passing systems
DISC'05 Proceedings of the 19th international conference on Distributed Computing
Ω meets paxos: leader election and stability without eventual timely links
DISC'05 Proceedings of the 19th international conference on Distributed Computing
DISC'05 Proceedings of the 19th international conference on Distributed Computing
Communication-efficient implementation of failure detector classes ♦;Q and ♦;P
DISC'05 Proceedings of the 19th international conference on Distributed Computing
Quantitative evaluation of distributed algorithms using the neko framework: the nekostat extension
LADC'05 Proceedings of the Second Latin-American conference on Dependable Computing
LADC'05 Proceedings of the Second Latin-American conference on Dependable Computing
Generating fast atomic commit from hyperfast consensus
LADC'05 Proceedings of the Second Latin-American conference on Dependable Computing
QoS self-configuring failure detectors for distributed systems
DAIS'10 Proceedings of the 10th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Two abstractions for implementing atomic objects in dynamic systems
OPODIS'05 Proceedings of the 9th international conference on Principles of Distributed Systems
Implementing reliable distributed real-time systems with the Θ-model
OPODIS'05 Proceedings of the 9th international conference on Principles of Distributed Systems
Architecting Dependable Systems III
Dependable Systems
Advances in the design and implementation of group communication middleware
Dependable Systems
Modular approach to replication for availability
Replication
Algorithms for extracting timeliness graphs
SIROCCO'10 Proceedings of the 17th international conference on Structural Information and Communication Complexity
Divide and concur: employing chandra and toueg's consensus algorithm in a multi-level setting
ICDCIT'05 Proceedings of the Second international conference on Distributed Computing and Internet Technology
Ramos: Concurrent writing and reconfiguration for collaborative systems
Journal of Parallel and Distributed Computing
A behavioral model for software containers
FASE'06 Proceedings of the 9th international conference on Fundamental Approaches to Software Engineering
Anonymous agreement: the janus algorithm
OPODIS'11 Proceedings of the 15th international conference on Principles of Distributed Systems
Easy impossibility proofs for k-set agreement in message passing systems
OPODIS'11 Proceedings of the 15th international conference on Principles of Distributed Systems
Byzantine fault-tolerance with commutative commands
OPODIS'11 Proceedings of the 15th international conference on Principles of Distributed Systems
On the implementation of concurrent objects
Dependable and Historic Computing
Leader election for replicated services using application scores
Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Asynchronous failed sensor node detection method for sensor networks
International Journal of Network Management
A simple distributed algorithm for the maintenance of a spanning tree
VECoS'07 Proceedings of the First international conference on Verification and Evaluation of Computer and Communication Systems
The renaming problem in shared memory systems: An introduction
Computer Science Review
PODC '12 Proceedings of the 2012 ACM symposium on Principles of distributed computing
Asynchronous failure detectors
PODC '12 Proceedings of the 2012 ACM symposium on Principles of distributed computing
On the (limited) power of non-equivocation
PODC '12 Proceedings of the 2012 ACM symposium on Principles of distributed computing
PODC '12 Proceedings of the 2012 ACM symposium on Principles of distributed computing
Failure detection in a RESTful way
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II
Scoped synchronization constraints for large scale actor systems
COORDINATION'12 Proceedings of the 14th international conference on Coordination Models and Languages
Increasing the power of the iterated immediate snapshot model with failure detectors
SIROCCO'12 Proceedings of the 19th international conference on Structural Information and Communication Complexity
Specifying and implementing an eventual leader service for dynamic systems
International Journal of Web and Grid Services
Modeling and validating the performance of atomic broadcast algorithms in high latency networks
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
On detecting termination in the crash-recovery model
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Looking for a definition of dynamic distributed systems
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
From unreliable objects to reliable objects: the case of atomic registers and consensus
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
Weakening failure detectors for k-set agreement via the partition approach
DISC'07 Proceedings of the 21st international conference on Distributed Computing
From crash-stop to permanent omission: automatic transformation and weakest failure detectors
DISC'07 Proceedings of the 21st international conference on Distributed Computing
On the message complexity of indulgent consensus
DISC'07 Proceedings of the 21st international conference on Distributed Computing
Efficient transformations of obstruction-free algorithms into non-blocking algorithms
DISC'07 Proceedings of the 21st international conference on Distributed Computing
Eventually perfect failure detectors using ADD channels
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
On the implementation of communication-optimal failure detectors
LADC'07 Proceedings of the Third Latin-American conference on Dependable Computing
Connectivity in eventually quiescent dynamic distributed systems
LADC'07 Proceedings of the Third Latin-American conference on Dependable Computing
Exploiting partitioned synchrony to implement accurate failure detectors
International Journal of Critical Computer-Based Systems
From a store-collect object and Ω to efficient asynchronous consensus
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Non-blocking atomic commitment in asynchronous distributed systems with faulty processes
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Formal verification of distributed algorithms: from pseudo code to checked proofs
TCS'12 Proceedings of the 7th IFIP TC 1/WG 202 international conference on Theoretical Computer Science
Leader election for replicated services using application scores
Proceedings of the 12th International Middleware Conference
Distributed algorithms for the creation of a new distributed IDS in MANETs
IDCS'12 Proceedings of the 5th international conference on Internet and Distributed Computing Systems
Brief announcement: anonymity, failures, detectors and consensus
DISC'12 Proceedings of the 26th international conference on Distributed Computing
ACM SIGOPS Operating Systems Review
A message omission failure approach to detect the quality of links in WSN
UCAmI'12 Proceedings of the 6th international conference on Ubiquitous Computing and Ambient Intelligence
A Failure Detection System for Large Scale Distributed Systems
International Journal of Distributed Systems and Technologies
Enhancing group communication with self-manageable behavior
Journal of Parallel and Distributed Computing
Multicasting in the presence of aggregated deliveries
Journal of Parallel and Distributed Computing
MoSQL: an elastic storage engine for MySQL
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Identifying incompatible service implementations using pooled decision trees
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Improving availability in distributed systems with failure informers
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Synchrony weakened by message adversaries vs asynchrony restricted by failure detectors
Proceedings of the 2013 ACM symposium on Principles of distributed computing
Avoiding disruptive failovers in transaction processing systems with multiple active nodes
Journal of Parallel and Distributed Computing
Towards a complexity theory for local distributed computing
Journal of the ACM (JACM)
Post-failure recovery of MPI communication capability: Design and rationale
International Journal of High Performance Computing Applications
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
There is more consensus in Egalitarian parliaments
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
LibRe: a consistency protocol for modern storage systems
Proceedings of the 6th ACM India Computing Convention
On the scalability of snapshot isolation
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Corona: A stabilizing deterministic message-passing skip list
Theoretical Computer Science
Theoretical Computer Science
Algorithms for a distributed IDS in MANETs
Journal of Computer and System Sciences
Quorum-based mutual exclusion in asynchronous distributed systems with unreliable failure detectors
The Journal of Supercomputing
Hi-index | 0.08 |
We introduce the concept of unreliable failure detectors and study how they can be used to solve Consensus in asynchronous systems with crash failures. We characterise unreliable failure detectors in terms of two properties—completeness and accuracy. We show that Consensus can be solved even with unreliable failure detectors that make an infinite number of mistakes, and determine which ones can be used to solve Consensus despite any number of crashes, and which ones require a majority of correct processes. We prove that Consensus and Atomic Broadcast are reducible to each other in asynchronous systems with crash failures; thus, the above results also apply to Atomic Broadcast. A companion paper shows that one of the failure detectors introduced here is the weakest failure detector for solving Consensus [Chandra et al. 1992].