DARX: a self-healing framework for agents

Authors:
Olivier Marin;Marin Bertier;Pierre Sens;Zahia Guessoum;Jean-Pierre Briot
Affiliations:
Laboratoire d'Informatique de Paris 6, University Paris 6, CNRS, Paris Cedex 05, France and INRIA-Rocquencourt, Rocquencourt, France;Unité de recherche INRIA Rennes, IRISA, Rennes Cedex, France;Laboratoire d'Informatique de Paris 6, University Paris 6, CNRS, Paris Cedex 05, France and INRIA-Rocquencourt, Rocquencourt, France;Laboratoire d'Informatique de Paris 6, University Paris 6, CNRS, Paris Cedex 05, France and INRIA-Rocquencourt, Rocquencourt, France;Laboratoire d'Informatique de Paris 6, University Paris 6, CNRS, Paris Cedex 05, France
Venue:
Proceedings of the 12th Monterey conference on Reliable systems on unreliable networked platforms
Year:
2005

Citing 14
Cited 0

Communicating sequential processes

Communicating sequential processes
Unreliable failure detectors for reliable distributed systems

Journal of the ACM (JACM)
Horus: a flexible group communication system

Communications of the ACM
Chameleon: A Software Infrastructure for Adaptive Fault Tolerance

IEEE Transactions on Parallel and Distributed Systems
Improving fault-tolerance by replicating agents

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
From Active Objects to Autonomous Agents

IEEE Concurrency
Software-Based Replication for Fault Tolerance

Computer
MASIF: The OMG Mobile Agent System Interoperability Facility

MA '98 Proceedings of the Second International Workshop on Mobile Agents
Fault-Tolerant Execution of Mobile Agents

DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
FATOMAS-A Fault-Tolerant Mobile Agent System Based on the Agent-Dependent Approach

DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Implementation and Performance Evaluation of an Adaptable Failure Detector

DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
A Sentinel Approach to Fault Handling in Multi-Agent Systems

Revised Papers from the Second Australian Workshop on Distributed Artificial Intelligence: Multi-Agent Systems: Methodologies and Applications
The Adaptive Agent Architecture: Achieving Fault-Tolerance Using Persistent Broker Teams

ICMAS '00 Proceedings of the Fourth International Conference on MultiAgent Systems (ICMAS-2000)
Adaptive replication of large-scale multi-agent systems: towards a fault-tolerant multi-agent platform

SELMAS '05 Proceedings of the fourth international workshop on Software engineering for large-scale multi-agent systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents DARX, our framework for building failure-resilient applications through adaptive fault tolerance. It relies on the fact that multi-agent platforms constitute a very strong basis for decentralized software that is both flexible and scalable, and makes the assumption that the relative importance of each agent varies during the course of the computation. DARX regroups solutions which facilitate the creation of multi-agent applications in a large-scale context. Its most important feature is adaptive replication: replication strategies are applied on a per-agent basis with respect to transient environment characteristics such as the importance of the agent for the computation, the network load or the mean time between failures. Firstly, the interwoven concerns of multi-agent systems and fault-tolerant solutions are put forward. An overview of the DARX architecture follows, as well as an evaluation of its performances. We conclude, after outlining the promising outcomes, by presenting prospective work.