The Evolving Philosophers Problem: Dynamic Change Management
IEEE Transactions on Software Engineering
A Classification and Comparison Framework for Software Architecture Description Languages
IEEE Transactions on Software Engineering
A graph based architectural (Re)configuration language
Proceedings of the 8th European software engineering conference held jointly with 9th ACM SIGSOFT international symposium on Foundations of software engineering
Model-based adaptation for self-healing systems
WOSS '02 Proceedings of the first workshop on Self-healing systems
xADL: Enabling Architecture-Centric Tool Integration with XML
HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 9 - Volume 9
Deployment and Dynamic Reconfiguration Planning for Distributed Software Systems
ICTAI '03 Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence
Automated Planning: Theory & Practice
Automated Planning: Theory & Practice
Mae---a system model and environment for managing architectural evolution
ACM Transactions on Software Engineering and Methodology (TOSEM)
PDDL2.1: an extension to PDDL for expressing temporal planning domains
Journal of Artificial Intelligence Research
Dealing with failures during failure recovery of distributed systems
DEAS '05 Proceedings of the 2005 workshop on Design and evolution of autonomic application software
Computing optimal self-repair actions: damage minimization versus repair time
WADS '05 Proceedings of the 2005 workshop on Architecting dependable systems
From goals to components: a combined approach to self-management
Proceedings of the 2008 international workshop on Software engineering for adaptive and self-managing systems
An Architecture for Supporting Network Fault Recovery Management
AIMS '08 Proceedings of the 2nd international conference on Autonomous Infrastructure, Management and Security: Resilient Networks and Services
A Case Study in Goal-Driven Architectural Adaptation
Software Engineering for Self-Adaptive Systems
Failure resilient real-time data federation system
SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
PLASMA: a plan-based layered architecture for software model-driven adaptation
Proceedings of the IEEE/ACM international conference on Automated software engineering
Case-based reasoning for autonomous service failure diagnosis and remediation in software systems
ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
Automatic undo for cloud management via AI planning
HotDep'12 Proceedings of the Eighth USENIX conference on Hot Topics in System Dependability
Supporting undoability in systems operations
LISA'13 Proceedings of the 27th international conference on Large Installation System Administration
Hi-index | 0.00 |
Failure recovery in distributed systems poses a difficult challenge because of the requirement for high availability. Failure scenarios are usually unpredictable so they can not easily be foreseen. In this research we propose a planning based approach to failure recovery. This approach automates failure recovery by capturing the state after failure, defining an acceptable recovered state as a goal and applying planning to get from the initial state to the goal state. By using planning, this approach can recover from a variety of failed states and reach any of several acceptable states: from minimal functionality to complete recovery.