A Comprehensive Survey of Multiagent Reinforcement Learning

Authors:
L. Busoniu;R. Babuska;B. De Schutter
Affiliations:
Delft Univ. of Technol., Delft;-;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Year:
2008

Citing 0
Cited 65

Q-Learning Based on Dynamical Structure Neural Network for Robot Navigation in Unknown Environment

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Globally Optimal Multi-agent Reinforcement Learning Parameters in Distributed Task Assignment

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Swarm Reinforcement Learning Algorithm Based on Particle Swarm Optimization Whose Personal Bests Have Lifespans

ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
Fuzzy ant colony optimization for optimal control

ACC'09 Proceedings of the 2009 conference on American Control Conference
Multi-agent Q-learning of channel selection in multi-user cognitive radio systems: a two by two case

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A novel technique to design a fuzzy logic controller using Q(λ)-learning and genetic algorithms in the pursuit-evasion game

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A novel hybrid learning technique applied to a self-learning multi-robot system

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Probability Collectives: A multi-agent approach for solving combinatorial optimization problems

Applied Soft Computing
A multi-agent reinforcement learning approach to path selection in optical burst switching networks

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Evolving policy geometry for scalable multiagent learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Intelligent negotiation behaviour model for an open railway access market

Expert Systems with Applications: An International Journal
An adaptive Q-learning algorithm developed for agent-based computational modeling of electricity market

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Self-learning fuzzy logic controllers for pursuit-evasion differential games

Robotics and Autonomous Systems
Online planning for multi-agent systems with bounded communication

Artificial Intelligence
Multiagent Q-learning for aloha-like spectrum access in cognitive radio systems

EURASIP Journal on Wireless Communications and Networking
A note on the learning effect in multi-agent optimization

Expert Systems with Applications: An International Journal
Swarm reinforcement learning method based on an actor-critic method

SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy

Expert Systems with Applications: An International Journal
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
Learning chasing behaviours of non-player characters in games using SARSA

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Concurrent modular Q-learning with local rewards on linked multi-component robotic systems

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Learning from experience to generate new regulations

COIN@AAMAS'10 Proceedings of the 6th international conference on Coordination, organizations, institutions, and norms in agent systems
Towards concurrent Q-learning on linked multi-component robotic systems

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Machine learning and agents

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Theoretical considerations of potential-based reward shaping for multi-agent systems

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Modeling agents and agent systems

Transactions on computational collective intelligence V
Evaluation of an automated mechanism for generating new regulations

CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Market self-organization under limited information

CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Coordination of cooperation policies in a peer-to-peer system using swarm-based RL

Journal of Network and Computer Applications
Reconciling strategic and tactical decision making in agent-oriented simulation of vehicles in urban traffic

Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Learning-Based spectrum selection in cognitive radio ad hoc networks

WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Multi-agent reinforcement learning for simulating pedestrian navigation

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
The single processor total weighted completion time scheduling problem with the sum-of-processing-time based learning model

Information Sciences: an International Journal
A probability collectives approach with a feasibility-based rule for constrained optimization

Applied Computational Intelligence and Soft Computing
Dyna-H: A heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems

Knowledge-Based Systems
Using experience to generate new regulations

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality

Automatica (Journal of IFAC)
Dynamic potential-based reward shaping

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes

AI Communications
Levels of realism for cooperative multi-agent reinforcement learning

ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part I
Distributed self-organizing bandwidth allocation for priority-based bus communication

Concurrency and Computation: Practice & Experience
Machine learning in agent-based stochastic simulation: Inferential theory and evaluation in transportation logistics

Computers & Mathematics with Applications
Exploiting independent relationships in multiagent systems for coordinated learning

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Multirobot behavior synchronization through direct neural network communication

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part II
Review: A survey on interference management techniques in femtocell self-organizing networks

Journal of Network and Computer Applications
Continuous strategy replicator dynamics for multi-agent Q-learning

Autonomous Agents and Multi-Agent Systems
Multi-agent learning and the reinforcement gradient

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Game designers training first person shooter bots

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Undesired state-action prediction in multi-agent reinforcement learning for linked multi-component robotic system control

Information Sciences: an International Journal
Performance of distributed multi-agent multi-state reinforcement spectrum management using different exploration schemes

Expert Systems with Applications: An International Journal
Makespan minimization flowshop with position dependent job processing times-computational complexity and solution algorithms

Computers and Operations Research
Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Incremental clustering and expansion for faster optimal planning in decentralized POMDPs

Journal of Artificial Intelligence Research
A reinforcement learning-based routing for delay tolerant networks

Engineering Applications of Artificial Intelligence
A personalized QoE-aware handover decision based on distributed reinforcement learning

Wireless Networks
Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

Engineering Applications of Artificial Intelligence
An actor-critic algorithm for multi-agent learning in queue-based stochastic games

Neurocomputing
Radigost: Interoperable web-based multi-agent platform

Journal of Systems and Software
Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous Teams

Journal of Intelligent and Robotic Systems
Multiagent meta-level control for radar coordination

Web Intelligence and Agent Systems
A survey of multi-objective sequential decision-making

Journal of Artificial Intelligence Research
Exact and parallel metaheuristic algorithms for the single processor total weighted completion time scheduling problem with the sum-of-processing-time based models

Computers and Operations Research
Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection

Intelligent Data Analysis

Quantified Score

Hi-index	0.01

Visualization

Abstract

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many tasks arising in these domains makes them difficult to solve with preprogrammed agent behaviors. The agents must, instead, discover a solution on their own, using learning. A significant part of the research on multiagent learning concerns reinforcement learning techniques. This paper provides a comprehensive survey of multiagent reinforcement learning (MARL). A central issue in the field is the formal statement of the multiagent learning goal. Different viewpoints on this issue have led to the proposal of many different goals, among which two focal points can be distinguished: stability of the agents' learning dynamics, and adaptation to the changing behavior of the other agents. The MARL algorithms described in the literature aim---either explicitly or implicitly---at one of these two goals or at a combination of both, in a fully cooperative, fully competitive, or more general setting. A representative selection of these algorithms is discussed in detail in this paper, together with the specific issues that arise in each category. Additionally, the benefits and challenges of MARL are described along with some of the problem domains where the MARL techniques have been applied. Finally, an outlook for the field is provided.