Polymorphic self-* agents for stigmergic fault mitigation in large-scale real-time embedded systems
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
The RTES project: BTeV, and beyond
RTC'05 Proceedings of the 14th IEEE-NPSS conference on Real time
Environment organization of roles using polymorphism
E4MAS'05 Proceedings of the 2nd international conference on Environments for Multi-Agent Systems
Hi-index | 0.00 |
This paper describes a comprehensive prototype of large-scale fault adaptive embedded software developed for the proposed Fermilab BTeV high energy physics experiment. Lightweight self-optimizing agents embedded within Level 1 of the prototype are responsible for proactive and reactive monitoring and mitigation based on specified layers of competence. The agents are self-protecting, detecting cascading failures using a distributed approach. Adaptive, reconfigurable, and mobile objects for reliablility are designed to be self-configuring to adapt automatically to dynamically changing environments. These objects provide a self-healing layer with the ability to discover, diagnose, and react to discontinuities in real-time processing. A generic modeling environment was developed to facilitate design and implementation of hardware resource specifications, application data flow, and failure mitigation strategies. Level 1 of the planned BTeV trigger system alone will consist of 2500 DSPs, so the number of components and intractable fault scenarios involved make it impossible to design an ýexpert systemý that applies traditional centralized mitigative strategies based on rules capturing every possible system state. Instead, a distributed reactive approach is implemented using the tools and methodologies developed by theReal-Time Embedded Systems group.