Computing the fault tolerance of multi-agent deployment

  • Authors:
  • Yingqian Zhang;Efrat Manisterski;Sarit Kraus;V. S. Subrahmanian;David Peleg

  • Affiliations:
  • Faculty of Electrical Engineering, Mathematics, and Computer Science, Delft University of Technology, 2628 CD Delft, The Netherlands;Department of Computer Science, Bar-Ilan University, Ramat Gan, 52900 Israel;Department of Computer Science, Bar-Ilan University, Ramat Gan, 52900 Israel and Department of Computer Science & UMIACS, University of Maryland, College Park, MD 20742, USA;Department of Computer Science & UMIACS, University of Maryland, College Park, MD 20742, USA;Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel

  • Venue:
  • Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A deployment of a multi-agent system on a network refers to the placement of one or more copies of each agent on network hosts, in such a manner that the memory constraints of each node are satisfied. Finding the deployment that is most likely to tolerate faults (i.e. have at least one copy of each agent functioning and in communication with other agents) is a challenge. In this paper, we address the problem of finding the probability of survival of a deployment (i.e. the probability that a deployment will tolerate faults), under the assumption that node failures are independent. We show that the problem of computing the survival probability of a deployment is at least NP-hard. Moreover, it is hard to approximate. We produce two algorithms to accurately compute the probability of survival of a deployment-these algorithms are expectedly exponential. We also produce five heuristic algorithms to estimate survival probabilities-these algorithms work in acceptable time frames. We report on a detailed set of experiments to determine the conditions under which some of these algorithms perform better than the others.