Towards context-aware adaptive fault tolerance in SOA applications

Authors:
Jonas Buys;Vincenzo De Florio;Chris Blondia
Affiliations:
University of Antwerp, Antwerp, Belgium;University of Antwerp, Antwerp, Belgium;University of Antwerp, Antwerp, Belgium
Venue:
Proceedings of the 5th ACM international conference on Distributed event-based system
Year:
2011

Citing 13
Cited 3

Design & analysis of fault tolerant digital systems

Design & analysis of fault tolerant digital systems
Understanding fault-tolerant distributed systems

Communications of the ACM
High-Availability Computer Systems

Computer
Handbook of software reliability engineering

Handbook of software reliability engineering
Dependable Computing Systems: Paradigms, Performance Issues, and Applications (Wiley Series on Parallel and Distributed Computing)

Dependable Computing Systems: Paradigms, Performance Issues, and Applications (Wiley Series on Parallel and Distributed Computing)
Service-Oriented Architecture: Concepts, Technology, and Design

Service-Oriented Architecture: Concepts, Technology, and Design
The N-Version Approach to Fault-Tolerant Software

IEEE Transactions on Software Engineering
Towards fault tolerance in web services compositions

Proceedings of the 2007 workshop on Engineering fault tolerant systems
Iterative Service Orchestration based on Dependability Attributes

SEAA '08 Proceedings of the 2008 34th Euromicro Conference Software Engineering and Advanced Applications
Application-Layer Fault-Tolerance Protocols

Application-Layer Fault-Tolerance Protocols
SOA Design Patterns

SOA Design Patterns
An adaptive QoS-aware fault tolerance strategy for web services

Empirical Software Engineering
Software assumptions failure tolerance: role, strategies, and visions

Architecting dependable systems VII

Robust-and-evolvable resilient software systems: open problems and lessons learned

Proceedings of the 8th workshop on Assurances for self-adaptive systems
WSDM-enabled autonomic augmentation of classical multi-version software fault-tolerance mechanisms

SAFECOMP'12 Proceedings of the 2012 international conference on Computer Safety, Reliability, and Security
A systematic review of design diversity-based solutions for fault-tolerant SOAs

Proceedings of the 17th International Conference on Evaluation and Assessment in Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Software components are expected to exhibit highly dependable characteristics in mission-critical applications, particularly in the areas of reliability and timeliness. Redundancy-based fault-tolerant strategies have long been used as a means to avoid a disruption in the service provided by the system in spite of the occurrence of failures in the underlying components. Adopting these fault-tolerance strategies in highly dynamic distributed computing systems, in which components often suffer from long response times or temporary unavailability, does not necessarily result in the anticipated improvement in dependability. In fact, as these dependability strategies are usually statically predefined and immutable, a change in the operational status (context) of any of the components involved may very well jeopardise the schemes' overall effectiveness. In this paper, a novel dependability strategy is introduced supporting advanced redundancy management, aiming to autonomously tune its internal configuration in view of changes in context. It is apparent from our preliminary experimentation that this strategy can effectively achieve an optimal trade-off between service reliability and performance-related factors such as timeliness and the degree of redundancy employed. A prototypical service-oriented implementation of the proposed adaptive fault tolerant strategy is presented thereafter, leveraging WS-* specifications to gather and disseminate contextual information.