IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 2 - Volume 03
Proceedings of the 4th on Middleware doctoral symposium
Current research and practice in proactive fault management
International Journal of Computers and Applications
Ecotopia: an ecological framework for change management in distributed systems
Architecting dependable systems IV
Towards middleware for fault-tolerance in distributed real-time and embedded systems
DAIS'08 Proceedings of the 8th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Journal of Systems Architecture: the EUROMICRO Journal
Proactive fault tolerance in MPI applications via task migration
HiPC'06 Proceedings of the 13th international conference on High Performance Computing
The design of real-time fault detectors
OTM'05 Proceedings of the 2005 Confederated international conference on On the Move to Meaningful Internet Systems - Volume >Part I
Hi-index | 0.00 |
Unanticipated runtime events, such as faults, can leadto missed deadlines in real-time systems. While it is notalways possible to know when a fault will occur, we cansometimes exploit pre-fault "symptoms" to initiate proactive(rather than reactive) fault-recovery. In this paper, wedescribe the design and implementation of a proactive recoverystrategy for distributed CORBA applications in thepresence of resource-exhaustion faults. We analyze the effectof different proactive recovery schemes on client/serverresponse times, and we demonstrate a significant reduction,both in jitter and in the number of client-side failures.