Exploiting structure and utilizing agent-centric rewards to promote coordination in large multiagent systems

Authors:
Chris HolmesParker;Adrian Agogino;Kagan Tumer
Affiliations:
Oregon State University, Corvallis, Oregon, USA;NASA Ames Research Center, Moffett Field, California, USA;Oregon State University, Corvallis, Oregon, USA
Venue:
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Year:
2013

Citing 4
Cited 0

Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Efficient structure learning in factored-state MDPs

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Efficient solution algorithms for factored MDPs

Journal of Artificial Intelligence Research
Evolving large scale UAV communication system

Proceedings of the 14th annual conference on Genetic and evolutionary computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

A goal within the field of multiagent systems is to achieve scaling to large systems involving hundreds or thousands of agents. In such systems the communication requirements for agents as well as the individual agents' ability to make decisions both play critical roles in performance. We take an incremental step towards improving scalability in such systems by introducing a novel algorithm that conglomerates three well-known existing techniques to address both agent communication requirements as well as decision making within large multiagent systems. In particular, we couple a Factored-Action Factored Markov Decision Process (FA-FMDP) framework which exploits problem structure and establishes localized rewards for agents (reducing communication requirements) with reinforcement learning using agent-centric difference rewards which addresses agent decision making and promotes coordination by addressing the structural credit assignment problem. We demonstrate our algorithms performance compared to two other popular reward techniques (global, local) with up to 10,000 agents.