On the minimal synchronism needed for distributed consensus
Journal of the ACM (JACM)
Consensus in the presence of partial synchrony
Journal of the ACM (JACM)
A quality of service architecture
ACM SIGCOMM Computer Communication Review
Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Unreliable failure detectors for reliable distributed systems
Journal of the ACM (JACM)
The weakest failure detector for solving consensus
Journal of the ACM (JACM)
Fail-awareness in timed asynchronous systems
PODC '96 Proceedings of the fifteenth annual ACM symposium on Principles of distributed computing
Multimedia Systems
Building adaptive systems using ensemble
Software—Practice & Experience - Special issue on multiprocessor operating systems
Fault-tolerant broadcasts and related problems
Distributed systems (2nd Ed.)
Real-Time Dependable Channels: Customizing QoS Attributes for Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
The Timed Asynchronous Distributed System Model
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Software Engineering
An Adaptive Algorithm for Tolerating Value Faults and Crash Failures
IEEE Transactions on Parallel and Distributed Systems
Distributed Algorithms
IEEE MultiMedia
The Timely Computing Base Model and Architecture
IEEE Transactions on Computers
Consensus in Synchronous Systems: A Concise Guided Tour
PRDC '02 Proceedings of the 2002 Pacific Rim International Symposium on Dependable Computing
The Information Structure of Indulgent Consensus
IEEE Transactions on Computers
A Hybrid and Adaptive Model for Fault-Tolerant Distributed Computing
DSN '05 Proceedings of the 2005 International Conference on Dependable Systems and Networks
Low complexity Byzantine-resilient consensus
Distributed Computing
A Generic Group Communication Approach for Hybrid Distributed Systems
DAIS '09 Proceedings of the 9th IFIP WG 6.1 International Conference on Distributed Applications and Interoperable Systems
Future Generation Computer Systems
ADVE: adaptive and dependable virtual environments for grid computing
GPC'08 Proceedings of the 3rd international conference on Advances in grid and pervasive computing
Exploiting partitioned synchrony to implement accurate failure detectors
International Journal of Critical Computer-Based Systems
The Journal of Supercomputing
Hi-index | 0.00 |
The capability of dynamically adapting to distinct runtime conditions is an important issue when designing distributed systems where negotiated quality of service (QoS) cannot always be delivered between processes. Providing fault tolerance for such dynamic environments is a challenging task. Considering such a context, this paper proposes an adaptive programming model for fault-tolerant distributed computing, which provides upper-layer applications with process state information according to the current system synchrony (or QoS). The underlying system model is hybrid, composed by a synchronous part (where there are time bounds on processing speed and message delay) and an asynchronous part (where there is no time bound). However, such a composition can vary over time, and, in particular, the system may become totally asynchronous (e.g., when the underlying system QoS degrade) or totally synchronous. Moreover, processes are not required to share the same view of the system synchrony at a given time. To illustrate what can be done in this programming model and how to use it, the consensus problem is taken as a benchmark problem. This paper also presents an implementation of the model that relies on a negotiated quality of service (QoS) for communication channels.