An Unreliable Failure Detector for Unknown and Mobile Networks
OPODIS '08 Proceedings of the 12th International Conference on Principles of Distributed Systems
Implementing the Omega failure detector in the crash-recovery failure model
Journal of Computer and System Sciences
A simple and communication-efficient Omega algorithm in the crash-recovery model
Information Processing Letters
Communication-efficient leader election in crash-recovery systems
Journal of Systems and Software
Hi-index | 0.00 |
In this paper we explore the minimal system requirements to implement unreliable failure detectors. We first consider systems formed by lossy asynchronous and eventually timely links. On these systems we define two properties, the Weak Property and the Strong Property, depending on whether all correct processes can be reached with links that are not lossy asynchronous from one or from all correct processes, respectively. We present necessary conditions based on these properties. We show that there is no algorithm that implements 3S, , nor S (resp. 3P nor P) if we allow one single failure in a system that, when all processes are correct, does not satisfy the Weak (resp. Strong) Property. Then, we propose an algorithm that implements 3P if the Strong Property is satisfied, and 3S (andwith an additional assumption) if only the Weak Property is satisfied. For systems formed by synchronous and lossy asynchronous links only, we propose another algorithm that implements detector class P4 if the Strong Property is satisfied, and implements a new detector class S0 (andwith an additional assumption) if only the Weak Property is satisfied.