An Adaptive Algorithm for Tolerating Value Faults and Crash Failures
IEEE Transactions on Parallel and Distributed Systems
AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects
IEEE Transactions on Computers
A Dynamic Replica Selection Algorithm for Tolerating Timing Faults
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
A taxonomy of software architecture-based reliability efforts
Proceedings of the 2010 ICSE Workshop on Sharing and Reusing Architectural Knowledge
Hi-index | 0.00 |
Building dependable distributed systems from commercial off-the-shelf components is of growing practical importance. For both cost and produc-tion reasons, there is interest in approaches and architectures that facilitate building such systems. The AQuA architecture is one such approach; its goal is to provide adaptive fault tolerance to CORBA applications by repli-cating objects, providing a high-level method for applications to specify their desired dependability, and providing a dependability manager that attempts to reconfigure a system at runtime so that dependability requests are satisfied. This paper describes how dependability is provided in AQuA. In particular, we describe Proteus, the part of AQuA that dynamically man-ages replicated distributed objects to make them dependable. Given a de-pendability request, Proteus chooses a fault tolerance approach and recon-figures the system to try to meet the request. The infrastructure of Proteus is described in this paper, along with its use in implementing active replica-tion and a simple dependability policy.