Proactive Recovery in Distributed CORBA Applications
DSN '04 Proceedings of the 2004 International Conference on Dependable Systems and Networks
SRDS '04 Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems
Hi-index | 0.00 |
The MEAD system that we are developing employs a synergistic combination of a reactiveand a proactive fault-tolerance approach in order to address unanticipated events andhazards in real-time, fault-tolerant distributed systems. The reactive fault-tolerance approach involves active monitoring of the system to adapt the provided QoS and to allocate resources based on current conditions in the system. The proactive approach involves monitoring both the distributed applications and the network to seek pre-cursors to imminent failures, and then to trigger fault-recovery mechanisms in advance of the occurrence of the failure. The underlying ideas of the MEAD system have demonstrated initial promise through our enhanced capabilities to handle failures and unanticipated events, and to reduce jitter under faulty conditions.