A multiagent cooperative learning algorithm

Authors:
Fei Liu;Guangzhou Zeng
Affiliations:
School of Computer Science and Technology, Shandong University, Jinan, P.R. China;School of Computer Science and Technology, Shandong University, Jinan, P.R. China
Venue:
CSCWD'06 Proceedings of the 10th international conference on Computer supported cooperative work in design III
Year:
2006

Citing 7
Cited 0

Evolutionary learning of communicating agents

Information Sciences—Informatics and Computer Science: An International Journal
Multi-agent Robot Learning by Means of Genetic Programming: Solving an Escape Problem

ICES '01 Proceedings of the 4th International Conference on Evolvable Systems: From Biology to Hardware
Convergence Problems of General-Sum Multiagent Reinforcement Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Multi-Agent Reinforcement Learning for Planning and Scheduling Multiple Goals

ICMAS '00 Proceedings of the Fourth International Conference on MultiAgent Systems (ICMAS-2000)
When evolving populations is better than coevolving individuals: the blind mice problem

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Reinforcement learning in distributed domains: beyond team games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Expertness based cooperative Q-learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Some multiagent learning methods simply extend reinforcement learning to multiple agents. In these methods, large state and action spaces are the most difficult problems. Moreover, previous proposals for using learning techniques to coordinate multiple agents have mostly relied on explicit or implicit information sharing, which makes cooperation affected by communication delays and the reliability of the information received. A Multiagent Cooperative Learning Algorithm (MCLA) is presented to solve these problems. In MCLA, an evaluating strategy based on long-time reward is proposed. Thus each agent acts independently and autonomously by perceiving and estimating each other. It also considers the learning process from the holistic point of view to obtain the optimum associated action strategy in order to reduce the state and action spaces. A series of simulations are provided to demonstrate the performance of the proposed algorithm.