Model-based adaptation for self-healing systems
WOSS '02 Proceedings of the first workshop on Self-healing systems
A biologically inspired programming model for self-healing systems
WOSS '02 Proceedings of the first workshop on Self-healing systems
Lessons from Giant-Scale Services
IEEE Internet Computing
Pinpoint: Problem Determination in Large, Dynamic Internet Services
DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Managing Dependencies in Component-Based Distributed Applications
FIDJI '01 Revised Papers from the International Workshop on Scientific Engineering for Distributed Java Applications
Static vs. Dynamic Recovery Models for Survivable Distributed Systems
HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 2 - Volume 2
A planning based approach to failure recovery in distributed systems
WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
Robust reconfigurations of component assemblies
Proceedings of the 2013 International Conference on Software Engineering
Hi-index | 0.00 |
One of the characteristics of autonomic systems is self recovery from failures. Self recovery can be achieved through sensing failures, planning for recovery and executing the recovery plan to bring the system back to a normal state. For various reasons, however, additional failures are possible during the process of recovering from the initial failure. Handling such secondary failures is important because they can cause the original recovery plan to fail and can leave the system in a complicated state that is worse than before. In this paper techniques are identified to preserve consistency while dealing with such failures that occur during failure recovery.