Value-function reinforcement learning in Markov games

Authors:
Michael L. Littman
Affiliations:
AT&T Labs Research, 180 Park Avenue, Florham Park, NJ 07932-0971, USA
Venue:
Cognitive Systems Research
Year:
2001

Citing 13
Cited 29

Learning to coordinate without sharing information

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Asynchronous Stochastic Approximation and Q-Learning

Machine Learning
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Experimental Results on Q-Learning for General-Sum Stochastic Games

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Convergence Problems of General-Sum Multiagent Reinforcement Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Rationality Assumptions and Optimality of Co-learning

PRIMA '00 Proceedings of the Third Pacific Rim International Workshop on Multi-Agents: Design and Applications of Intelligent Agents
Evaluating Concurrent Reinforcement Learners

ICMAS '00 Proceedings of the Fourth International Conference on MultiAgent Systems (ICMAS-2000)
Planning, learning and coordination in multiagent decision processes

TARK '96 Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms

Neural Computation
On the convergence of stochastic iterative dynamic programming algorithms

Neural Computation
Reinforcement learning: a survey

Journal of Artificial Intelligence Research

A multiagent reinforcement learning algorithm using extended optimal response

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Minimax Fuzzy Q-Learning in Cooperative Multi-agent Systems

ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Multiagent coordination by Extended Markov Tracking

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Robust Reinforcement Learning

Neural Computation
Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems

Journal of Intelligent and Robotic Systems
A robust Markov game controller for nonlinear systems

Applied Soft Computing
Perspectives on multiagent learning

Artificial Intelligence
Application of reinforcement learning to the game of Othello

Computers and Operations Research
A machine-learning approach to multi-robot coordination

Engineering Applications of Artificial Intelligence
Learning of coordination: exploiting sparse interactions in multiagent systems

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Exploiting locality of interactions using a policy-gradient approach in multiagent learning

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
A multiagent reinforcement learning algorithm with non-linear dynamics

Journal of Artificial Intelligence Research
Nash Q-learning multi-agent flow control for high-speed networks

ACC'09 Proceedings of the 2009 conference on American Control Conference
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Convergence of independent adaptive learners

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Multi-agent reinforcement learning and chimpanzee hunting

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Coordinated learning in multiagent MDPs with infinite state-space

Autonomous Agents and Multi-Agent Systems
A novel multi-agent reinforcement learning approach for job scheduling in Grid computing

Future Generation Computer Systems
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
Evolving equilibrium policies for a multiagent reinforcement learning problem with state attractors

ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Multi-agent congestion control for high-speed networks using reinforcement co-learning

ISNN'05 Proceedings of the Second international conference on Advances in Neural Networks - Volume Part III
Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality

Automatica (Journal of IFAC)
Distributed self-organizing bandwidth allocation for priority-based bus communication

Concurrency and Computation: Practice & Experience
Better manufacturing process organization using multi-agent self-organization and co-evolutionary classifier systems: The multibar problem

Applied Soft Computing
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Performance of distributed multi-agent multi-state reinforcement spectrum management using different exploration schemes

Expert Systems with Applications: An International Journal
Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

Engineering Applications of Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Markov games are a model of multiagent environments that are convenient for studying multiagent reinforcement learning. This paper describes a set of reinforcement-learning algorithms based on estimating value functions and presents convergence theorems for these algorithms. The main contribution of this paper is that it presents the convergence theorems in a way that makes it easy to reason about the behavior of simultaneous learners in a shared environment.