Distributed learning of best response behaviors in concurrent iterated many-object negotiations

  • Authors:
  • Jan Ole Berndt;Otthein Herzog

  • Affiliations:
  • Center for Computing and Communication Technologies (TZI), Universität Bremen, Germany;Center for Computing and Communication Technologies (TZI), Universität Bremen, Germany

  • Venue:
  • MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Iterated negotiations are a well-established method for coordinating distributed activities in multiagent systems. However, if several of these take place concurrently, the participants' activities can mutually influence each other. In order to cope with the problem of interrelated interaction outcomes in partially observable environments, we apply distributed reinforcement learning to concurrent many-object negotiations. To this end, we discuss iterated negotiations from the perspective of repeated games, specify the agents' learning behavior, and introduce decentral decision-making criteria for terminating a negotiation. Furthermore, we empirically evaluate the approach in a multiagent resource allocation scenario. The results show that our method enables the agents to successfully learn mutual best response behaviors which approximate Nash equilibrium allocations. Additionally, the learning constrains the required interaction effort for attaining these results.