Multiagent learning using a variable learning rate

Authors:
Affiliations:
Venue:
Artificial Intelligence
Year:
2002

Citing 16
Cited 129

Learning to coordinate without sharing information

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Competitive Markov decision processes

Competitive Markov decision processes
On-line learning and the metrical task system problem

COLT '97 Proceedings of the tenth annual conference on Computational learning theory
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Gradient descent for general reinforcement learning

Proceedings of the 1998 conference on Advances in neural information processing systems II
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Convergence of Gradient Dynamics with a Variable Learning Rate

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Convergence Problems of General-Sum Multiagent Reinforcement Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Nash Convergence of Gradient Dynamics in General-Sum Games

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Reinforcement Learning in POMDP's via Direct Gradient Ascent

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Dynamic Programming

Dynamic Programming
Learning in dynamic noncooperative multiagent systems

Learning in dynamic noncooperative multiagent systems
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Rational and convergent learning in stochastic games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2

Convergent Gradient Ascent in General-Sum Games

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Approximation Techniques in Multiagent Learning

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Online oblivious routing

Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
Adaptive policy gradient in multiagent learning

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Multi-agent learning in extensive games with complete information

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Communication complexity as a lower bound for learning in games

ICML '04 Proceedings of the twenty-first international conference on Machine learning
The Role of Reactivity in Multiagent Learning

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Learning to Communicate and Act Using Hierarchical Reinforcement Learning

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Efficient learning of multi-step best response

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Rapid on-line temporal sequence prediction by an adaptive agent

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Theory of moves learners: towards non-myopic equilibria

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Learning to compete, compromise, and cooperate in repeated general-sum games

ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning from induced changes in opponent (re)actions in multi-agent games

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning against multiple opponents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning to cooperate in multi-agent social dilemmas

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
RVσ(t): a unifying approach to performance and convergence in online multiagent learning

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Can good learners always compensate for poor learners?

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning the task allocation game

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning to commit in repeated games

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Resolution-Based Policy Search for Imperfect Information Differential Games

IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems

Journal of Intelligent and Robotic Systems
Multi-agent learning model with bargaining

Proceedings of the 38th conference on Winter simulation
Dimensions of complexity of intelligent agents

PCAR '06 Proceedings of the 2006 international symposium on Practical cognitive agents and robots
A general criterion and an algorithmic framework for learning in multi-agent systems

Machine Learning
AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents

Machine Learning
Introduction to the special issue on learning and computational game theory

Machine Learning
Gradient descent for symmetric and asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Perspectives on multiagent learning

Artificial Intelligence
Reaching pareto-optimality in prisoner's dilemma using conditional joint action learning

Autonomous Agents and Multi-Agent Systems
Reactivity and Safe Learning in Multi-Agent Systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A layered approach to learning coordination knowledge in multiagent environments

Applied Intelligence
Generalized multiagent learning with performance bound

Autonomous Agents and Multi-Agent Systems
Multiagent reinforcement learning and self-organization in a network of agents

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Multiagent learning in adaptive dynamic systems

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Reinforcement learning in extensive form games with incomplete information: the bargaining case study

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Advice taking in multiagent reinforcement learning

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Emergence of Norms with Biased Interactions in Heterogeneous Agent Societies

WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
A fuzzy constraint-based agent negotiation with opponent learning

ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
Fairness in multi-agent systems

The Knowledge Engineering Review
Norm emergence under constrained interactions in diverse societies

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Artificial agents learning human fairness

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
A few good agents: multi-agent social learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Non-linear dynamics in multiagent reinforcement learning algorithms

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Using adaptive consultation of experts to improve convergence rates in multiagent learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
A Novel Method of Constructing ANN

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Competition and Coordination in Stochastic Games

CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
A Learning Automata Approach to Multi-agent Policy Gradient Learning

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Recognizing the Enemy: Combining Reinforcement Learning with Strategy Selection Using Case-Based Reasoning

ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Optimistic-Pessimistic Q-Learning Algorithm for Multi-Agent Systems

MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
An adaptive policy gradient in learning Nash equilibria

Neurocomputing
Individual and Social Behaviour in the IPA Market with RL

SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Meta-level Control of Multiagent Learning in Dynamic Repeated Resource Sharing Problems

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Stability of learning dynamics in two-agent, imperfect-information games

Proceedings of the tenth ACM SIGEVO workshop on Foundations of genetic algorithms
COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS

Cybernetics and Systems
Reinforcement Learning: A Tutorial Survey and Recent Advances

INFORMS Journal on Computing
Learning the IPA market with individual and social rewards

Web Intelligence and Agent Systems
Multiagent reinforcement learning: algorithm converging to Nash equilibrium in general-sum discounted stochastic games

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning-Rate Adjusting Q-Learning for Two-Person Two-Action Symmetric Games

KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Tentative Exploration on Reinforcement Learning Algorithms for Stochastic Rewards

HAIS '09 Proceedings of the 4th International Conference on Hybrid Artificial Intelligence Systems
Performance bounded reinforcement learning in strategic interactions

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games

Web Intelligence and Agent Systems
Efficient no-regret multiagent learning

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Non-stationary policy learning in 2-player zero sum games

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
RETALIATE: learning winning policies in first-person shooter games

IAAI'07 Proceedings of the 19th national conference on Innovative applications of artificial intelligence - Volume 2
Existence of multiagent equilibria with limited agents

Journal of Artificial Intelligence Research
A multiagent reinforcement learning algorithm with non-linear dynamics

Journal of Artificial Intelligence Research
Emergence of norms through social learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Simultaneous adversarial multi-robot learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Learning against opponents with bounded memory

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Two-sided bandits and the dating market

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Effective learning in the presence of adaptive counterparts

Journal of Algorithms
Anytime Self-play Learning to Satisfy Functional Optimality Criteria

ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
Adaptive Learning in Systems of Interacting Agents

WINE '09 Proceedings of the 5th International Workshop on Internet and Network Economics
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Probability Collectives: A multi-agent approach for solving combinatorial optimization problems

Applied Soft Computing
Modeling opponent's beliefs via fuzzy constraint-directed approach in agent negotiation

ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Cooperation between multiple agents based on partially sharing policy

ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Approximation guarantees for fictitious play

Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Multi-agent reinforcement learning and chimpanzee hunting

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning

ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Frequency adjusted multi-agent Q-learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Using graph analysis to study networks of adaptive agent

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Evolving policy geometry for scalable multiagent learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Learning hybridization strategies in evolutionary algorithms

Intelligent Data Analysis
Coordinated learning in multiagent MDPs with infinite state-space

Autonomous Agents and Multi-Agent Systems
The Dynamics of Multi-Agent Reinforcement Learning

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Learning opponent's beliefs via fuzzy constraint-directed approach to make effective agent negotiation

Applied Intelligence
Convergence of probability collectives with adaptive choice of temperature parameters

LION'10 Proceedings of the 4th international conference on Learning and intelligent optimization
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
Sequential targeted optimality as a new criterion for teaching and following in repeated games

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Evolving equilibrium policies for a multiagent reinforcement learning problem with state attractors

ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Social welfare for automatic innovation

MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
Market self-organization under limited information

CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Trust model architecture: defining prejudice by learning

TrustBus'06 Proceedings of the Third international conference on Trust, Privacy, and Security in Digital Business
Exploiting based pre-testing in competition environment

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
A momentum-based approach to learning nash equilibria

PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Recursive adaptation of stepsize parameter for non-stationary environments

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Adaption of stepsize parameter using newton's method

PRIMA'11 Proceedings of the 14th international conference on Agents in Principle, Agents in Practice
An overview of cooperative and competitive multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Learning pareto-optimal solutions in 2x2 conflict games

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Unifying convergence and no-regret in multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
A probability collectives approach with a feasibility-based rule for constrained optimization

Applied Computational Intelligence and Soft Computing
Centralized and distributed task allocation in multi-robot teams via a stochastic clustering auction

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An extension of a hierarchical reinforcement learning algorithm for multiagent settings

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
When speed matters in learning against adversarial opponents

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
A common gradient in multi-agent reinforcement learning

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Learning to achieve socially optimal solutions in general-sum games

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Multirobot behavior synchronization through direct neural network communication

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part II
Promoting cooperation in service-oriented MAS through social plasticity and incentives

Journal of Systems and Software
Continuous strategy replicator dynamics for multi-agent Q-learning

Autonomous Agents and Multi-Agent Systems
Multi-agent learning and the reinforcement gradient

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Norm Emergence with Biased Agents

International Journal of Agent Technologies and Systems
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Performance of distributed multi-agent multi-state reinforcement spectrum management using different exploration schemes

Expert Systems with Applications: An International Journal
Emergence of social norms through collective learning in networked agent societies

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Modeling non-stationary opponents

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A reinforcement learning-based routing for delay tolerant networks

Engineering Applications of Artificial Intelligence
Strategic interactions among agents with bounded rationality

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
A personalized QoE-aware handover decision based on distributed reinforcement learning

Wireless Networks
Multiagent meta-level control for radar coordination

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on the policies of the other agents. This creates a situation of learning a moving target. Previous learning algorithms have one of two shortcomings depending on their approach. They either converge to a policy that may not be optimal against the specific opponents' policies, or they may not converge at all. In this article we examine this learning problem in the framework of stochastic games. We look at a number of previous learning algorithms showing how they fail at one of the above criteria. We then contribute a new reinforcement learning technique using a variable learning rate to overcome these shortcomings. Specifically, we introduce the WoLF principle, "Win or Learn Fast", for varying the learning rate. We examine this technique theoretically, proving convergence in self-play on a restricted class of iterated matrix games. We also present empirical results on a variety of more general stochastic games, in situations of self-play and otherwise, demonstrating the wide applicability of this method.