Q-Learning Based on Dynamical Structure Neural Network for Robot Navigation in Unknown Environment
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Globally Optimal Multi-agent Reinforcement Learning Parameters in Distributed Task Assignment
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
Fuzzy ant colony optimization for optimal control
ACC'09 Proceedings of the 2009 conference on American Control Conference
Multi-agent Q-learning of channel selection in multi-user cognitive radio systems: a two by two case
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A novel hybrid learning technique applied to a self-learning multi-robot system
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A multi-agent reinforcement learning approach to path selection in optical burst switching networks
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Evolving policy geometry for scalable multiagent learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Intelligent negotiation behaviour model for an open railway access market
Expert Systems with Applications: An International Journal
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Self-learning fuzzy logic controllers for pursuit-evasion differential games
Robotics and Autonomous Systems
Online planning for multi-agent systems with bounded communication
Artificial Intelligence
Multiagent Q-learning for aloha-like spectrum access in cognitive radio systems
EURASIP Journal on Wireless Communications and Networking
A note on the learning effect in multi-agent optimization
Expert Systems with Applications: An International Journal
Swarm reinforcement learning method based on an actor-critic method
SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy
Expert Systems with Applications: An International Journal
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Learning chasing behaviours of non-player characters in games using SARSA
EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Concurrent modular Q-learning with local rewards on linked multi-component robotic systems
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation - Volume Part I
Learning from experience to generate new regulations
COIN@AAMAS'10 Proceedings of the 6th international conference on Coordination, organizations, institutions, and norms in agent systems
Towards concurrent Q-learning on linked multi-component robotic systems
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Theoretical considerations of potential-based reward shaping for multi-agent systems
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Modeling agents and agent systems
Transactions on computational collective intelligence V
Evaluation of an automated mechanism for generating new regulations
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Market self-organization under limited information
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Coordination of cooperation policies in a peer-to-peer system using swarm-based RL
Journal of Network and Computer Applications
Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Learning-Based spectrum selection in cognitive radio ad hoc networks
WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Multi-agent reinforcement learning for simulating pedestrian navigation
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Information Sciences: an International Journal
A probability collectives approach with a feasibility-based rule for constrained optimization
Applied Computational Intelligence and Soft Computing
Using experience to generate new regulations
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Automatica (Journal of IFAC)
Dynamic potential-based reward shaping
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Levels of realism for cooperative multi-agent reinforcement learning
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part I
Distributed self-organizing bandwidth allocation for priority-based bus communication
Concurrency and Computation: Practice & Experience
Computers & Mathematics with Applications
Exploiting independent relationships in multiagent systems for coordinated learning
PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Multirobot behavior synchronization through direct neural network communication
ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part II
Review: A survey on interference management techniques in femtocell self-organizing networks
Journal of Network and Computer Applications
Continuous strategy replicator dynamics for multi-agent Q-learning
Autonomous Agents and Multi-Agent Systems
Multi-agent learning and the reinforcement gradient
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Game designers training first person shooter bots
AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Information Sciences: an International Journal
Expert Systems with Applications: An International Journal
Computers and Operations Research
Machine learning for interactive systems and robots: a brief introduction
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Incremental clustering and expansion for faster optimal planning in decentralized POMDPs
Journal of Artificial Intelligence Research
A reinforcement learning-based routing for delay tolerant networks
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence
Radigost: Interoperable web-based multi-agent platform
Journal of Systems and Software
Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous Teams
Journal of Intelligent and Robotic Systems
Multiagent meta-level control for radar coordination
Web Intelligence and Agent Systems
A survey of multi-objective sequential decision-making
Journal of Artificial Intelligence Research
Hi-index | 0.01 |
Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many tasks arising in these domains makes them difficult to solve with preprogrammed agent behaviors. The agents must, instead, discover a solution on their own, using learning. A significant part of the research on multiagent learning concerns reinforcement learning techniques. This paper provides a comprehensive survey of multiagent reinforcement learning (MARL). A central issue in the field is the formal statement of the multiagent learning goal. Different viewpoints on this issue have led to the proposal of many different goals, among which two focal points can be distinguished: stability of the agents' learning dynamics, and adaptation to the changing behavior of the other agents. The MARL algorithms described in the literature aim---either explicitly or implicitly---at one of these two goals or at a combination of both, in a fully cooperative, fully competitive, or more general setting. A representative selection of these algorithms is discussed in detail in this paper, together with the specific issues that arise in each category. Additionally, the benefits and challenges of MARL are described along with some of the problem domains where the MARL techniques have been applied. Finally, an outlook for the field is provided.