Gradient descent for symmetric and asymmetric multiagent reinforcement learning

Authors:
Ville Kö/nö/nen
Affiliations:
Neural Networks Research Centre, Helsinki University of Technology, P.O. Box 5400, FI-02015 HUT, Finland. Tel.: +358 9 451 5024/ Fax: +358 9 451 3277/ E-mail: ville.kononen@hut.fi
Venue:
Web Intelligence and Agent Systems
Year:
2005

Citing 19
Cited 1

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Machine Learning
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Adaptivity in agent-based routing for data networks

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Using collective intelligence to route Internet traffic

Proceedings of the 1998 conference on Advances in neural information processing systems II
Gradient descent for general reinforcement learning

Proceedings of the 1998 conference on Advances in neural information processing systems II
Multiagent learning using a variable learning rate

Artificial Intelligence
Machine Learning

Machine Learning
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Friend-or-Foe Q-learning in General-Sum Games

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning

Sequence Learning - Paradigms, Algorithms, and Applications
Learning to Cooperate via Policy Search

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Nash Convergence of Gradient Dynamics in General-Sum Games

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

UAI '01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence
Reinforcement learning of coordination in cooperative multi-agent systems

Eighteenth national conference on Artificial intelligence
Asymmetric Multiagent Reinforcement Learning

IAT '03 Proceedings of the IEEE/WIC International Conference on Intelligent Agent Technology
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Practical Bilevel Optimization: Algorithms and Applications (Nonconvex Optimization and Its Applications)

Practical Bilevel Optimization: Algorithms and Applications (Nonconvex Optimization and Its Applications)
AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents

Machine Learning
Complexity results about Nash equilibria

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

A Learning Automata Approach to Multi-agent Policy Gradient Learning

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the problem with agents involved in the learning task having equal information states. Respectively, in asymmetric multiagent reinforcement learning, the information states are not equal, i.e. some agents (leaders) try to encourage agents with less information (followers) to select actions that lead to improved overall utility values for the leaders. In both cases, there are a huge number of parameters to learn and we thus need to use some parametric function approximation methods to represent the value functions of the agents. The method proposed in this paper is based on the VAPS framework that is extended to utilize the theory of Markov games, which is a natural basis of multiagent reinforcement learning.