Design & analysis of fault tolerant digital systems
Design & analysis of fault tolerant digital systems
Understanding fault-tolerant distributed systems
Communications of the ACM
High-Availability Computer Systems
Computer
Handbook of software reliability engineering
Handbook of software reliability engineering
Dependable Computing Systems: Paradigms, Performance Issues, and Applications (Wiley Series on Parallel and Distributed Computing)
Service-Oriented Architecture: Concepts, Technology, and Design
Service-Oriented Architecture: Concepts, Technology, and Design
The N-Version Approach to Fault-Tolerant Software
IEEE Transactions on Software Engineering
Towards fault tolerance in web services compositions
Proceedings of the 2007 workshop on Engineering fault tolerant systems
Iterative Service Orchestration based on Dependability Attributes
SEAA '08 Proceedings of the 2008 34th Euromicro Conference Software Engineering and Advanced Applications
Application-Layer Fault-Tolerance Protocols
Application-Layer Fault-Tolerance Protocols
SOA Design Patterns
An adaptive QoS-aware fault tolerance strategy for web services
Empirical Software Engineering
Software assumptions failure tolerance: role, strategies, and visions
Architecting dependable systems VII
Robust-and-evolvable resilient software systems: open problems and lessons learned
Proceedings of the 8th workshop on Assurances for self-adaptive systems
WSDM-enabled autonomic augmentation of classical multi-version software fault-tolerance mechanisms
SAFECOMP'12 Proceedings of the 2012 international conference on Computer Safety, Reliability, and Security
A systematic review of design diversity-based solutions for fault-tolerant SOAs
Proceedings of the 17th International Conference on Evaluation and Assessment in Software Engineering
Hi-index | 0.00 |
Software components are expected to exhibit highly dependable characteristics in mission-critical applications, particularly in the areas of reliability and timeliness. Redundancy-based fault-tolerant strategies have long been used as a means to avoid a disruption in the service provided by the system in spite of the occurrence of failures in the underlying components. Adopting these fault-tolerance strategies in highly dynamic distributed computing systems, in which components often suffer from long response times or temporary unavailability, does not necessarily result in the anticipated improvement in dependability. In fact, as these dependability strategies are usually statically predefined and immutable, a change in the operational status (context) of any of the components involved may very well jeopardise the schemes' overall effectiveness. In this paper, a novel dependability strategy is introduced supporting advanced redundancy management, aiming to autonomously tune its internal configuration in view of changes in context. It is apparent from our preliminary experimentation that this strategy can effectively achieve an optimal trade-off between service reliability and performance-related factors such as timeliness and the degree of redundancy employed. A prototypical service-oriented implementation of the proposed adaptive fault tolerant strategy is presented thereafter, leveraging WS-* specifications to gather and disseminate contextual information.