Interval-based markov decision processes for regulating interactions between two agents in multi-agent systems

  • Authors:
  • Graçaliz P. Dimuro;Antônio C. R. Costa

  • Affiliations:
  • Escola de Informática, Universidade Católica de Pelotas, Pelotas, Brazil;,Escola de Informática, Universidade Católica de Pelotas, Pelotas, Brazil

  • Venue:
  • PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work presents a model for Markov Decision Processes applied to the problem of keeping two agents in equilibrium with respect to the values they exchange when they interact. Interval mathematics is used to model the qualitative values involved in interactions. The optimal policy is constrained by the adopted model of social interactions. The MDP is assigned to a supervisor, that monitors the agents' actions and makes recommendations to keep them in equilibrium. The agents are autonomous and allowed to not follow the recommendations. Due to the qualitative nature of the exchange values, even when agents follow the recommendations, the decision process is non-trivial.