An architecture for self-healing autonomous object groups

Authors:
Hein Meling
Affiliations:
Department of Electrical Engineering and Computer Science, University of Stavanger, Stavanger, Norway
Venue:
ATC'07 Proceedings of the 4th international conference on Autonomic and Trusted Computing
Year:
2007

Citing 12
Cited 0

The implementation of a CORBA object group service

Theory and Practice of Object Systems - Special issue high availability in CORBA
Group communication specifications: a comprehensive study

ACM Computing Surveys (CSUR)
Distributed Fault Tolerance: Lessons from Delta-4

IEEE Micro
AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects

IEEE Transactions on Computers
Towards Upgrading Actively Replicated Servers On-the-Fly

COMPSAC '02 Proceedings of the 26th International Computer Software and Applications Conference on Prolonging Software Life: Development and Redevelopment
A Low Latency, Loss Tolerant Architecture and Protocol for Wide Area Group Communication

DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
Eternal: a component-based framework for transparent fault-tolerant CORBA

Software—Practice & Experience - Special issue: Enterprise frameworks
DOORS: Towards High-Performance Fault Tolerant CORBA

DOA '00 Proceedings of the International Symposium on Distributed Objects and Applications
System Support for Programming Object-Oriented Dependable Applications in Partitionable Systems (Ph.D. Thesis)

System Support for Programming Object-Oriented Dependable Applications in Partitionable Systems (Ph.D. Thesis)
Autonomic Computing

Autonomic Computing
An approach to experimentally obtain service dependability characteristics of the Jgroup/ARM system

EDCC'05 Proceedings of the 5th European conference on Dependable Computing
Policy-based management of networked computing systems

IEEE Communications Magazine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Jgroup/ARM is a middleware for developing and operating dependable distributed Java applications. Jgroup integrates the distributed object model of Java RMI with the object group paradigm, enabling construction of replicated servers that offer dependable services to clients. ARM aims to improve the dependability characteristics of systems through fault treatment, focusing on operational aspects where the gain in terms of improved dependability is likely to be the greatest. ARM offers two core mechanisms: recovery from node, object and network failures and distribution of replicas. ARM identifies failures and reconfigures the system according to its dependability requirements. This paper proposes an enhancement of the ARM framework in which replica placement is performed in a distributed manner, eliminating the need for a centralized manager with global information about all object groups. Instead each autonomous object group handles their own replica placement based on information from nodes. Assuming that multiple objects groups are deployed in the system, this constitutes a distributed replica placement scheme. This scheme enables the implementation of self-healing object groups that can perform fault treatment on themselves. Advantages of the approach: (a) no need to maintain global information about all object groups which is costly and limits scalability, (b) reduced infrastructure complexity, and (c) less communication overhead.