The dynamics of reinforcement learning in cooperative multiagent systems

Authors:
Caroline Claus;Craig Boutilier
Affiliations:
-;-
Venue:
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Year:
1998

Citing 8
Cited 170

Technical Note: \cal Q-Learning

Machine Learning
Learning to coordinate without sharing information

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Planning, learning and coordination in multiagent decision processes

TARK '96 Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Learning to coordinate actions in multi-agent systems

IJCAI'93 Proceedings of the 13th international joint conference on Artifical intelligence - Volume 1
Learning conventions in multiagent stochastic domains using likelihood estimates

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence

The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
General principles of learning-based multi-agent systems

Proceedings of the third annual conference on Autonomous Agents
Conjectural Equilibrium in Multiagent Learning

Machine Learning
Adaptivity in agent-based routing for data networks

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Multiagent learning using a variable learning rate

Artificial Intelligence
A multiagent reinforcement learning algorithm using extended optimal response

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Learning sequences of actions in collectives of autonomous agents

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Designing agent collectives for systems with markovian dynamics

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to select a coordination mechanism

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning and decision: making for intention reconciliation

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
A Model of Partially Observable State Game and its Optimality

Applied Intelligence
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Social Agents Playing a Periodical Policy

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Convergent Gradient Ascent in General-Sum Games

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Learning to Reach the Pareto Optimal Nash Equilibrium as a Team

AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Sequential Strategy for Learning Multi-stage Multi-agent Collaborative Games

ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Learning Multi-agent Strategies in Multi-stage Collaborative Games

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Rationality Assumptions and Optimality of Co-learning

PRIMA '00 Proceedings of the Third Pacific Rim International Workshop on Multi-Agents: Design and Applications of Intelligent Agents
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer

RoboCup 2001: Robot Soccer World Cup V
Learning Mutual Trust

Proceedings of the workshop on Deception, Fraud, and Trust in Agent Societies held during the Autonomous Agents Conference: Trust in Cyber-societies, Integrating the Human and Artificial Perspectives
Implicit Negotiation in Repeated Games

ATAL '01 Revised Papers from the 8th International Workshop on Intelligent Agents VIII
Reinforcement learning of coordination in cooperative multi-agent systems

Eighteenth national conference on Artificial intelligence
Dispersion games: general definitions and some specific learning results

Eighteenth national conference on Artificial intelligence
Optimizing information exchange in cooperative multi-agent systems

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Resource allocation games with changing resource capacities

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Towards a pareto-optimal solution in general-sum games

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Adaptive policy gradient in multiagent learning

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Coordination in multiagent reinforcement learning: a Bayesian approach

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Sparse cooperative Q-learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
A multi-agent system integrating reinforcement learning, bidding and genetic algorithms

Web Intelligence and Agent Systems
Learning when and how to coordinate

Web Intelligence and Agent Systems
Best-Response Multiagent Learning in Non-Stationary Environments

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
The Role of Reactivity in Multiagent Learning

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Coordination through Mutual Notification in Cooperative Multiagent Reinforcement Learning

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Reinforcement Learning of Coordination in Heterogeneous Cooperative Multi-Agent Systems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Efficient learning equilibrium

Artificial Intelligence
Asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
A Model of Adaptation in Collaborative Multi-Agent Systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Theory of moves learners: towards non-myopic equilibria

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games

Autonomous Agents and Multi-Agent Systems
Learning against multiple opponents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Selecting informative actions improves cooperative multiagent learning

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Multi-agent reinforcement learning algorithm to handle beliefs of other agents' policies and embedded beliefs

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Lenient learners in cooperative multiagent systems

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Can good learners always compensate for poor learners?

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning the task allocation game

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Multi-agent learning model with bargaining

Proceedings of the 38th conference on Winter simulation
A general criterion and an algorithmic framework for learning in multi-agent systems

Machine Learning
AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents

Machine Learning
Gradient descent for symmetric and asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
If multi-agent learning is the answer, what is the question?

Artificial Intelligence
Perspectives on multiagent learning

Artificial Intelligence
Collaborative Multiagent Reinforcement Learning by Payoff Propagation

The Journal of Machine Learning Research
Exploring selfish reinforcement learning in repeated games with stochastic rewards

Autonomous Agents and Multi-Agent Systems
Reactivity and Safe Learning in Multi-Agent Systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A layered approach to learning coordination knowledge in multiagent environments

Applied Intelligence
Generalized multiagent learning with performance bound

Autonomous Agents and Multi-Agent Systems
Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation

Neural Computation
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Multiagent learning in adaptive dynamic systems

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective

The Journal of Machine Learning Research
A few good agents: multi-agent social learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Using the Simulated Annealing Algorithm for Multiagent Decision Making

RoboCup 2006: Robot Soccer World Cup X
Competition and Coordination in Stochastic Games

CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Multi-agent Learning Dynamics: A Survey

CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
VWM: An Improvement to Multiagent Coordination in Highly Dynamic Environments

MATES '07 Proceedings of the 5th German conference on Multiagent System Technologies
A Learning Automata Approach to Multi-agent Policy Gradient Learning

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Online Multiagent Learning against Memory Bounded Adversaries

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Optimistic-Pessimistic Q-Learning Algorithm for Multi-Agent Systems

MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
An adaptive policy gradient in learning Nash equilibria

Neurocomputing
Individual and Social Behaviour in the IPA Market with RL

SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Opportunities for multiagent systems and multiagent reinforcement learning in traffic control

Autonomous Agents and Multi-Agent Systems
Learning the IPA market with individual and social rewards

Web Intelligence and Agent Systems
Dynamic analysis of multiagent Q-learning with ε-greedy exploration

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Multiagent reinforcement learning: algorithm converging to Nash equilibrium in general-sum discounted stochastic games

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multiagent learning in large anonymous games

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Learning of coordination: exploiting sparse interactions in multiagent systems

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Performance bounded reinforcement learning in strategic interactions

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Point-based dynamic programming for DEC-POMDPs

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games

Web Intelligence and Agent Systems
Efficient no-regret multiagent learning

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Coordination and adaptation in impromptu teams

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Optimal efficient learning equilibrium: imperfect monitoring in symmetric games

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Towards an adaptive approach for distributed resource allocation in a multi-agent system for solving dynamic vehicle routing problems

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Collective intelligence, data routing and braess' paradox

Journal of Artificial Intelligence Research
Learning to Coordinate Efficiently: a model-based approach

Journal of Artificial Intelligence Research
Decentralized control of cooperative systems: categorization and complexity analysis

Journal of Artificial Intelligence Research
Existence of multiagent equilibria with limited agents

Journal of Artificial Intelligence Research
Reinforcement learning for agents with many sensors and actuators acting in categorizable environments

Journal of Artificial Intelligence Research
A multiagent reinforcement learning algorithm with non-linear dynamics

Journal of Artificial Intelligence Research
Predicting and preventing coordination problems in cooperative Q-learning systems

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Improving coevolutionary search for optimal multiagent behaviors

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Simultaneous adversarial multi-robot learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Reinforcement learning in distributed domains: beyond team games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Rational and convergent learning in stochastic games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning against opponents with bounded memory

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Effective learning in the presence of adaptive counterparts

Journal of Algorithms
A multi-agent learning approach to online distributed resource allocation

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Multiagent Reinforcement Learning with Spiking and Non-Spiking Agents in the Iterated Prisoner's Dilemma

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Anytime Self-play Learning to Satisfy Functional Optimality Criteria

ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
Hybrid Q-learning algorithm about cooperation in MAS

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Design of semi-decentralized control laws for distributed-air-jet micromanipulators by reinforcement learning

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Cooperative multi-robot reinforcement learning: a framework in hybrid state space

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Learning in groups of traffic signals

Engineering Applications of Artificial Intelligence
Optimal convergence in multi-agent MDPs

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Adaptation in games with many co-evolving agents

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Convergence of independent adaptive learners

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Towards a taxonomy of decision making problems in multi-agent systems

MATES'09 Proceedings of the 7th German conference on Multiagent system technologies
Reinforcement learning approaches to coordination in cooperative multi-agent systems

Adaptive agents and multi-agent systems
Cooperative learning using advice exchange

Adaptive agents and multi-agent systems
Coevolution of heterogeneous multi-robot teams

Proceedings of the 12th annual conference on Genetic and evolutionary computation
To teach or not to teach?: decision making under uncertainty in ad hoc teams

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Learning multi-agent state space representations

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Evolving policy geometry for scalable multiagent learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Action discovery for reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Coordinated learning in multiagent MDPs with infinite state-space

Autonomous Agents and Multi-Agent Systems
From cognition to docition: The teaching radio paradigm for distributed & autonomous deployments

Computer Communications
Multi-policy optimization in self-organizing systems

SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Social conformity and its convergence for reinforcement learning

MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Evolutionary dynamics of regret minimization

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Generalized learning automata for multi-agent reinforcement learning

AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Solving multi-stage games with hierarchical learning automata that bootstrap

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Improving space representation in multiagent learning via tile coding

SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
Theoretical convergence guarantees for cooperative coevolutionary algorithms

Evolutionary Computation
A novel multi-agent reinforcement learning approach for job scheduling in Grid computing

Future Generation Computer Systems
Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy

Expert Systems with Applications: An International Journal
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
A Multi-agent-based voltage control in power systems using distributed reinforcement learning

Simulation
Decentralized MDPs with sparse interactions

Artificial Intelligence
Multiagent learning in large anonymous games

Journal of Artificial Intelligence Research
Theoretical considerations of potential-based reward shaping for multi-agent systems

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Learning to cooperate via policy search

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
A momentum-based approach to learning nash equilibria

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Feature extraction for decision-theoretic planning in partially observable environments

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Meta-game equilibrium for multi-agent reinforcement learning

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Multi-agent case-based reasoning for cooperative reinforcement learners

ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
Coordinating learning agents for multiple resource job scheduling

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Learning automata as a basis for multi agent reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Learning pareto-optimal solutions in 2x2 conflict games

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
The success and failure of tag-mediated evolution of cooperation

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Multi-agent relational reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Multi-agent reinforcement learning for simulating pedestrian navigation

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Heterogeneous populations of learning agents in the minority game

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Solving sparse delayed coordination problems in multi-agent reinforcement learning

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
A brief introduction to agent mining

Autonomous Agents and Multi-Agent Systems
Transfer learning in multi-agent reinforcement learning domains

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Coordination guided reinforcement learning

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Dynamic potential-based reward shaping

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Combining independent and joint learning: a negotiation based approach

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
GRiDA: GReen Distributed Algorithm for energy-efficient IP backbone networks

Computer Networks: The International Journal of Computer and Telecommunications Networking
Distributed learning of best response behaviors in concurrent iterated many-object negotiations

MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Continuous strategy replicator dynamics for multi-agent Q-learning

Autonomous Agents and Multi-Agent Systems
Local coordination in online distributed constraint optimization problems

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Orchestrating multiagent learning of penalty games

SBIA'12 Proceedings of the 21st Brazilian conference on Advances in Artificial Intelligence
Performance of distributed multi-agent multi-state reinforcement spectrum management using different exploration schemes

Expert Systems with Applications: An International Journal
Addressing the policy-bias of q-learning by repeating updates

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Distributed relational temporal difference learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Coordinating multi-agent reinforcement learning with limited communication

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Reinforcement learning for decentralized planning under uncertainty

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination

Artificial Intelligence
The dynamics of reinforcement social learning in cooperative multiagent systems

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Multiagent learning in the presence of memory-bounded agents

Autonomous Agents and Multi-Agent Systems
Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate their action choices in multi agent systems. We examine some of the factors that can influence the dynamics of the learning process in such a setting. We first distinguish reinforcement learners that are unaware of (or ignore) the presence of other agents from those that explicitly attempt to learn the value of joint actions and the strategies of their counterparts. We study (a simple form of) Q-leaming in cooperative multi agent systems under these two perspectives, focusing on the influence of that game structure and exploration strategies on convergence to (optimal and suboptimal) Nash equilibria. We then propose alternative optimistic exploration strategies that increase the likelihood of convergence to an optimal equilibrium.