A Metaobject Architecture for Fault-Tolerant Distributed Systems: The FRIENDS Approach
IEEE Transactions on Computers
Theory and Practice of Object Systems - Special issue high availability in CORBA
Consistent object replication in the eternal system
Theory and Practice of Object Systems - Special issue high availability in CORBA
Gateways for accessing fault tolerance domains
IFIP/ACM International Conference on Distributed systems platforms
An Adaptive Algorithm for Tolerating Value Faults and Crash Failures
IEEE Transactions on Parallel and Distributed Systems
Building Secure and Reliable Network Applications
Building Secure and Reliable Network Applications
Reliable Distributed Computing with the ISIS Toolkit
Reliable Distributed Computing with the ISIS Toolkit
Using Interceptors to Enhance CORBA
Computer
The Rampart Toolkit for Building High-Integrity Services
Selected Papers from the International Workshop on Theory and Practice in Distributed Systems
State Synchronization and Recovery for Strongly Consistent Replicated CORBA Objects
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Design and implemantation of a CORBA fault-tolerant object group service
Proceedings of the IFIP WG 6.1 International Working Conference on Distributed Applications and Interoperable Systems II
Proteus: A Flexible Infrastructure to Implement Adaptive Fault Tolerance in AQuA
DCCA '99 Proceedings of the conference on Dependable Computing for Critical Applications
DOORS: Towards High-Performance Fault Tolerant CORBA
DOA '00 Proceedings of the International Symposium on Distributed Objects and Applications
The design of a CORBA group communication service
SRDS '96 Proceedings of the 15th Symposium on Reliable Distributed Systems
The Chameleon Infrastructure for Adaptive, Software Implemented Fault Tolerance
SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects
SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
Specifying and Measuring Quality of Service in Distributed Object Systems
ISORC '98 Proceedings of the The 1st IEEE International Symposium on Object-Oriented Real-Time Distributed Computing
An Object-level Gateway Supporting Integrated-Property Quality of Service
ISORC '99 Proceedings of the 2nd IEEE International Symposium on Object-Oriented Real-Time Distributed Computing
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Newtop: a fault-tolerant group communication protocol
ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
Providing Support for Survivable CORBA Applications with the Immune System
ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
Coyote: A System for Constructing Fine-Grain Configurable CommunicationServices
Coyote: A System for Constructing Fine-Grain Configurable CommunicationServices
The ensemble system
Implementing a replicated service with group communication
Journal of Systems Architecture: the EUROMICRO Journal
Unification of Transactions and Replication in Three-Tier Architectures Based on CORBA
IEEE Transactions on Dependable and Secure Computing
MIDDLE-R: Consistent database replication at the middleware level
ACM Transactions on Computer Systems (TOCS)
End-to-end latency of a fault-tolerant CORBA infrastructure
Performance Evaluation
Resource management using multiple feedback loops in soft real-time distributed object systems
Journal of Systems and Software
Proceedings of the 4th on Middleware doctoral symposium
Replica placement for high availability in distributed stream processing systems
Proceedings of the second international conference on Distributed event-based systems
Jgroup-ARM: a distributed object group platform with autonomous replication management
Software—Practice & Experience
Annotation Markers for Runtime Replication Protocol Selection
ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
Dynamic planning and weaving of dependability concerns for self-adaptive ubiquitous services
Proceedings of the 2009 ACM symposium on Applied Computing
Using allopoietic agents in replicated software to respond to errors, faults, and attacks
Proceedings of the 48th Annual Southeast Regional Conference
Journal of Systems Architecture: the EUROMICRO Journal
Adaptive voting for balancing data integrity with availability
OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II
Integrating fault tolerance and load balancing in distributed systems based on CORBA
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
An approach to experimentally obtain service dependability characteristics of the Jgroup/ARM system
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
The role of agreements in IT management software
Architecting Dependable Systems III
An architecture for self-healing autonomous object groups
ATC'07 Proceedings of the 4th international conference on Autonomic and Trusted Computing
Hi-index | 14.98 |
Building dependable distributed systems from commercial off-the-shelf components is of growing practical importance. For both cost and production reasons, there is interest in approaches and architectures that facilitate building such systems. The AQuA architecture is one such approach; its goal is to provide adaptive fault tolerance to CORBA applications by replicating objects. The AQuA architecture allows application programmers to request desired levels of dependability during applications' runtimes. It provides fault tolerance mechanisms to ensure that a CORBA client can always obtain reliable services, even if the CORBA server object that provides the desired services suffers from crash failures and value faults. AQuA includes a replicated dependability manager that provides dependability management by configuring the system in response to applications' requests and changes in system resources due to faults. It uses Maestro/Ensemble to provide group communication services. It contains a gateway to intercept standard CORBA IIOP messages to allow any standard CORBA application to use AQuA. It provides different types of replication schemes to forward messages reliably to the remote replicated objects. All of the replication schemes ensure strong data consistency among replicas. This paper describes the AQuA architecture and presents, in detail, the active replication pass-first scheme. In addition, the interface to the dependability manager and the design of the dependability manager replication are also described. Finally, we describe performance measurements that were conducted for the active replication pass-first scheme, and we present results from our study of fault detection, recovery, and blocking times.