Towards a fault-tolerant multi-agent system architecture
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Distributed Systems: Principles and Paradigms
Distributed Systems: Principles and Paradigms
FANTOMAS: Fault Tolerance for Mobile Agents in Clusters
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Exploiting Non-Determinism for Reliability of Mobile Agent Systems
DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
Fault Tolerance in Scalable Agent Support Systems: Integrating DARX in the AgentScape Framework
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
Emergence versus self-organisation: different concepts but promising when combined
Engineering Self-Organising Systems
Analysis and design of physical and social contexts in multi-agent systems
Software Engineering for Multi-Agent Systems IV
Grounding social interactions in the environment
E4MAS'05 Proceedings of the 2nd international conference on Environments for Multi-Agent Systems
Computing the fault tolerance of multi-agent deployment
Artificial Intelligence
Using allopoietic agents in replicated software to respond to errors, faults, and attacks
Proceedings of the 48th Annual Southeast Regional Conference
IMS 10-Validation of a co-evolving diagnostic algorithm for evolvable production systems
Engineering Applications of Artificial Intelligence
Hi-index | 0.00 |
High reliability and availability are of utmost importance to the majority of today's software-based systems. Typically, some kind of redundancy is used in order to achieve high reliability and availability in case of faults. Dynamic redundancy setups employ backup systems which take over as soon as the primary system fails. Additionally, central fault tolerance management systems are often used to guide the replication process. In this paper, we present an approach which relies on emergent fault-tolerance. The desired behavior is the result of local reconfigurations of self-organizing agents. This allows us to abandon the classical distinction of primary and backup systems as well as the need for a central management system. Our experimental results show the benefit of our approach.