Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

  • Authors:
  • M. A. S. Kamal;Junichi Murata

  • Affiliations:
  • (Correspd. Tel.: +603 6196 4417/ Fax: +603 6196 4433/ E-mail: maskamal@ieee.org) Department of Mechatronics Engineering, International Islamic University Malaysia, PO Box 10, 50728 Kuala Lumpur, M ...;Department of Electrical and Electronic Systems Engineering, Kyushu University, 744 Motooka, Nishi-Ku, Fukuoka 819-0395, Japan

  • Venue:
  • International Journal of Knowledge-based and Intelligent Engineering Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action estimating system dynamics in terms of both the natural reward for task achievement and the virtual reward for cooperation. The virtual reward for cooperation is ascertained dynamically by a coordinating agent who estimates it from the change in degree of cooperation of all agents using a separate reinforcement learning. This technique provides adaptive coordination, requires less communication and ensures agents to be cooperative. The validity of virtual rewards for convergence in learning is verified, and the proposed method is tested on two different simulated domains to illustrate its significance. The empirical performance of the coordinated system compared to the uncoordinated system illustrates its advantages for multiagent systems.