AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects

Authors:
Yansong (Jennifer) Ren;David E. Bakken;Tod Courtney;Michel Cukier;David A. Karr;Paul Rubel;Chetan Sabnis;William H. Sanders;Richard E. Schantz;Mouna Seri
Affiliations:
-;-;-;-;-;-;-;-;-;-
Venue:
IEEE Transactions on Computers
Year:
2003

Citing 25
Cited 17

A Metaobject Architecture for Fault-Tolerant Distributed Systems: The FRIENDS Approach

IEEE Transactions on Computers
The Maestro approach to building reliable interoperable distributed applications with multiple execution styles

Theory and Practice of Object Systems - Special issue high availability in CORBA
Consistent object replication in the eternal system

Theory and Practice of Object Systems - Special issue high availability in CORBA
Gateways for accessing fault tolerance domains

IFIP/ACM International Conference on Distributed systems platforms
An Adaptive Algorithm for Tolerating Value Faults and Crash Failures

IEEE Transactions on Parallel and Distributed Systems
Building Secure and Reliable Network Applications

Building Secure and Reliable Network Applications
Reliable Distributed Computing with the ISIS Toolkit

Reliable Distributed Computing with the ISIS Toolkit
Piranha: A CORBA Tool For High Availability

Computer
Using Interceptors to Enhance CORBA

Computer
Distributed Fault-Tolerant Real-Time Systems: The Mars Approach

IEEE Micro
The Rampart Toolkit for Building High-Integrity Services

Selected Papers from the International Workshop on Theory and Practice in Distributed Systems
State Synchronization and Recovery for Strongly Consistent Replicated CORBA Objects

DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Design and implemantation of a CORBA fault-tolerant object group service

Proceedings of the IFIP WG 6.1 International Working Conference on Distributed Applications and Interoperable Systems II
Proteus: A Flexible Infrastructure to Implement Adaptive Fault Tolerance in AQuA

DCCA '99 Proceedings of the conference on Dependable Computing for Critical Applications
DOORS: Towards High-Performance Fault Tolerant CORBA

DOA '00 Proceedings of the International Symposium on Distributed Objects and Applications
The design of a CORBA group communication service

SRDS '96 Proceedings of the 15th Symposium on Reliable Distributed Systems
The Chameleon Infrastructure for Adaptive, Software Implemented Fault Tolerance

SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects

SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
Specifying and Measuring Quality of Service in Distributed Object Systems

ISORC '98 Proceedings of the The 1st IEEE International Symposium on Object-Oriented Real-Time Distributed Computing
An Object-level Gateway Supporting Integrated-Property Quality of Service

ISORC '99 Proceedings of the 2nd IEEE International Symposium on Object-Oriented Real-Time Distributed Computing
The Totem System

FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Newtop: a fault-tolerant group communication protocol

ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
Providing Support for Survivable CORBA Applications with the Immune System

ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
Coyote: A System for Constructing Fine-Grain Configurable CommunicationServices

Coyote: A System for Constructing Fine-Grain Configurable CommunicationServices
The ensemble system

The ensemble system

Implementing a replicated service with group communication

Journal of Systems Architecture: the EUROMICRO Journal
Unification of Transactions and Replication in Three-Tier Architectures Based on CORBA

IEEE Transactions on Dependable and Secure Computing
MIDDLE-R: Consistent database replication at the middleware level

ACM Transactions on Computer Systems (TOCS)
End-to-end latency of a fault-tolerant CORBA infrastructure

Performance Evaluation
Resource management using multiple feedback loops in soft real-time distributed object systems

Journal of Systems and Software
FLARe: a Fault-tolerant Lightweight Adaptive Real-time middleware for distributed real-time and embedded systems

Proceedings of the 4th on Middleware doctoral symposium
Replica placement for high availability in distributed stream processing systems

Proceedings of the second international conference on Distributed event-based systems
Jgroup-ARM: a distributed object group platform with autonomous replication management

Software—Practice & Experience
Annotation Markers for Runtime Replication Protocol Selection

ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Dynamic planning and weaving of dependability concerns for self-adaptive ubiquitous services

Proceedings of the 2009 ACM symposium on Applied Computing
Using allopoietic agents in replicated software to respond to errors, faults, and attacks

Proceedings of the 48th Annual Southeast Regional Conference
Supporting component-based failover units in middleware for distributed real-time and embedded systems

Journal of Systems Architecture: the EUROMICRO Journal
Adaptive voting for balancing data integrity with availability

OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II
Integrating fault tolerance and load balancing in distributed systems based on CORBA

EDCC'05 Proceedings of the 5th European conference on Dependable Computing
An approach to experimentally obtain service dependability characteristics of the Jgroup/ARM system

EDCC'05 Proceedings of the 5th European conference on Dependable Computing
The role of agreements in IT management software

Architecting Dependable Systems III
An architecture for self-healing autonomous object groups

ATC'07 Proceedings of the 4th international conference on Autonomic and Trusted Computing

Quantified Score

Hi-index	14.98

Visualization

Abstract

Building dependable distributed systems from commercial off-the-shelf components is of growing practical importance. For both cost and production reasons, there is interest in approaches and architectures that facilitate building such systems. The AQuA architecture is one such approach; its goal is to provide adaptive fault tolerance to CORBA applications by replicating objects. The AQuA architecture allows application programmers to request desired levels of dependability during applications' runtimes. It provides fault tolerance mechanisms to ensure that a CORBA client can always obtain reliable services, even if the CORBA server object that provides the desired services suffers from crash failures and value faults. AQuA includes a replicated dependability manager that provides dependability management by configuring the system in response to applications' requests and changes in system resources due to faults. It uses Maestro/Ensemble to provide group communication services. It contains a gateway to intercept standard CORBA IIOP messages to allow any standard CORBA application to use AQuA. It provides different types of replication schemes to forward messages reliably to the remote replicated objects. All of the replication schemes ensure strong data consistency among replicas. This paper describes the AQuA architecture and presents, in detail, the active replication pass-first scheme. In addition, the interface to the dependability manager and the design of the dependability manager replication are also described. Finally, we describe performance measurements that were conducted for the active replication pass-first scheme, and we present results from our study of fault detection, recovery, and blocking times.