Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Failure detectors and the wait-free hierarchy (extended abstract)
Proceedings of the fourteenth annual ACM symposium on Principles of distributed computing
Unreliable failure detectors for reliable distributed systems
Journal of the ACM (JACM)
Failure detectors in omission failure environments
PODC '97 Proceedings of the sixteenth annual ACM symposium on Principles of distributed computing
Theoretical Computer Science
On scalable and efficient distributed failure detectors
Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
On the Quality of Service of Failure Detectors
IEEE Transactions on Computers
A fault detection service for wide area distributed computations
Cluster Computing
Perfect Failure Detection in Timed Asynchronous Systems
IEEE Transactions on Computers
Failure Detection and Consensus in the Crash-Recovery Model
DISC '98 Proceedings of the 12th International Symposium on Distributed Computing
Implementable Failure Detectors in Asynchronous Systems
Proceedings of the 18th Conference on Foundations of Software Technology and Theoretical Computer Science
Consensus in Asynchronous Systems Where Processes Can Crash and Recover
SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
Failure Detectors for Large-Scale Distributed Systems
SRDS '02 Proceedings of the 21st IEEE Symposium on Reliable Distributed Systems
A Gossip-Style Failure Detection Service
A Gossip-Style Failure Detection Service
An Adaptive Failure Detection Protocol
PRDC '01 Proceedings of the 2001 Pacific Rim International Symposium on Dependable Computing
A Markov Model for Quality of Service of Failure Detectors in the Pressure of Loss Bursts
AINA '04 Proceedings of the 18th International Conference on Advanced Information Networking and Applications - Volume 2
DSN '04 Proceedings of the 2004 International Conference on Dependable Systems and Networks
The " Accrual Failure Detector
SRDS '04 Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems
Experimental Evaluation of the QoS of Failure Detectors on Wide Area Network
DSN '05 Proceedings of the 2005 International Conference on Dependable Systems and Networks
Hi-index | 0.00 |
Crash failure detection is a key topic in fault tolerance, and it is important to be able to assess the QoS of failure detection services. Most previous work on crash failure detectors has been based on the crash-stop or fail-free assumption. In this paper we study and model a crash-recovery service which has the ability to recover from the crash state. We analyse the QoS bounds for such a crash-recovery failure detection service. Our results show that the dependability metrics of the monitored service will have an impact on the QoS of the failure detection service. Our results are corroborated by simulation results, showing bounds on the QoS.