Learning to coordinate without sharing information
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Competitive Markov decision processes
Competitive Markov decision processes
On-line learning and the metrical task system problem
COLT '97 Proceedings of the tenth annual conference on Computational learning theory
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Gradient descent for general reinforcement learning
Proceedings of the 1998 conference on Advances in neural information processing systems II
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Convergence of Gradient Dynamics with a Variable Learning Rate
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Convergence Problems of General-Sum Multiagent Reinforcement Learning
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Nash Convergence of Gradient Dynamics in General-Sum Games
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Reinforcement Learning in POMDP's via Direct Gradient Ascent
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Dynamic Programming
Learning in dynamic noncooperative multiagent systems
Learning in dynamic noncooperative multiagent systems
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
Rational and convergent learning in stochastic games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Convergent Gradient Ascent in General-Sum Games
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Approximation Techniques in Multiagent Learning
Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
Adaptive policy gradient in multiagent learning
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Multi-agent learning in extensive games with complete information
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Nash q-learning for general-sum stochastic games
The Journal of Machine Learning Research
Communication complexity as a lower bound for learning in games
ICML '04 Proceedings of the twenty-first international conference on Machine learning
The Role of Reactivity in Multiagent Learning
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Learning to Communicate and Act Using Hierarchical Reinforcement Learning
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
Efficient learning of multi-step best response
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Rapid on-line temporal sequence prediction by an adaptive agent
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Theory of moves learners: towards non-myopic equilibria
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Cooperative Multi-Agent Learning: The State of the Art
Autonomous Agents and Multi-Agent Systems
Learning to compete, compromise, and cooperate in repeated general-sum games
ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning from induced changes in opponent (re)actions in multi-agent games
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning against multiple opponents
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning to cooperate in multi-agent social dilemmas
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
RVσ(t): a unifying approach to performance and convergence in online multiagent learning
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Can good learners always compensate for poor learners?
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning the task allocation game
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning to commit in repeated games
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Resolution-Based Policy Search for Imperfect Information Differential Games
IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems
Journal of Intelligent and Robotic Systems
Multi-agent learning model with bargaining
Proceedings of the 38th conference on Winter simulation
Dimensions of complexity of intelligent agents
PCAR '06 Proceedings of the 2006 international symposium on Practical cognitive agents and robots
Gradient descent for symmetric and asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
Perspectives on multiagent learning
Artificial Intelligence
Reaching pareto-optimality in prisoner's dilemma using conditional joint action learning
Autonomous Agents and Multi-Agent Systems
Reactivity and Safe Learning in Multi-Agent Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A layered approach to learning coordination knowledge in multiagent environments
Applied Intelligence
Generalized multiagent learning with performance bound
Autonomous Agents and Multi-Agent Systems
Multiagent reinforcement learning and self-organization in a network of agents
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Multiagent learning in adaptive dynamic systems
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Advice taking in multiagent reinforcement learning
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Emergence of Norms with Biased Interactions in Heterogeneous Agent Societies
WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
A fuzzy constraint-based agent negotiation with opponent learning
ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
Fairness in multi-agent systems
The Knowledge Engineering Review
Norm emergence under constrained interactions in diverse societies
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Artificial agents learning human fairness
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
A few good agents: multi-agent social learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Non-linear dynamics in multiagent reinforcement learning algorithms
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Using adaptive consultation of experts to improve convergence rates in multiagent learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
A Novel Method of Constructing ANN
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Competition and Coordination in Stochastic Games
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
A Learning Automata Approach to Multi-agent Policy Gradient Learning
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Optimistic-Pessimistic Q-Learning Algorithm for Multi-Agent Systems
MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
An adaptive policy gradient in learning Nash equilibria
Neurocomputing
Individual and Social Behaviour in the IPA Market with RL
SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Meta-level Control of Multiagent Learning in Dynamic Repeated Resource Sharing Problems
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Stability of learning dynamics in two-agent, imperfect-information games
Proceedings of the tenth ACM SIGEVO workshop on Foundations of genetic algorithms
COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS
Cybernetics and Systems
Reinforcement Learning: A Tutorial Survey and Recent Advances
INFORMS Journal on Computing
Learning the IPA market with individual and social rewards
Web Intelligence and Agent Systems
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Learning-Rate Adjusting Q-Learning for Two-Person Two-Action Symmetric Games
KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Tentative Exploration on Reinforcement Learning Algorithms for Stochastic Rewards
HAIS '09 Proceedings of the 4th International Conference on Hybrid Artificial Intelligence Systems
Performance bounded reinforcement learning in strategic interactions
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games
Web Intelligence and Agent Systems
Efficient no-regret multiagent learning
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Non-stationary policy learning in 2-player zero sum games
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
RETALIATE: learning winning policies in first-person shooter games
IAAI'07 Proceedings of the 19th national conference on Innovative applications of artificial intelligence - Volume 2
Existence of multiagent equilibria with limited agents
Journal of Artificial Intelligence Research
A multiagent reinforcement learning algorithm with non-linear dynamics
Journal of Artificial Intelligence Research
Emergence of norms through social learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Simultaneous adversarial multi-robot learning
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Learning against opponents with bounded memory
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Two-sided bandits and the dating market
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Effective learning in the presence of adaptive counterparts
Journal of Algorithms
Anytime Self-play Learning to Satisfy Functional Optimality Criteria
ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
Adaptive Learning in Systems of Interacting Agents
WINE '09 Proceedings of the 5th International Workshop on Internet and Network Economics
Modeling opponent's beliefs via fuzzy constraint-directed approach in agent negotiation
ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Cooperation between multiple agents based on partially sharing policy
ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Approximation guarantees for fictitious play
Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Multi-agent reinforcement learning and chimpanzee hunting
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Distributed, heterogeneous, multi-agent social coordination via reinforcement learning
ROBIO'09 Proceedings of the 2009 international conference on Robotics and biomimetics
Frequency adjusted multi-agent Q-learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Using graph analysis to study networks of adaptive agent
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Evolving policy geometry for scalable multiagent learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Learning hybridization strategies in evolutionary algorithms
Intelligent Data Analysis
Coordinated learning in multiagent MDPs with infinite state-space
Autonomous Agents and Multi-Agent Systems
The Dynamics of Multi-Agent Reinforcement Learning
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Convergence of probability collectives with adaptive choice of temperature parameters
LION'10 Proceedings of the 4th international conference on Learning and intelligent optimization
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Sequential targeted optimality as a new criterion for teaching and following in repeated games
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Evolving equilibrium policies for a multiagent reinforcement learning problem with state attractors
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Social welfare for automatic innovation
MATES'11 Proceedings of the 9th German conference on Multiagent system technologies
Market self-organization under limited information
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Trust model architecture: defining prejudice by learning
TrustBus'06 Proceedings of the Third international conference on Trust, Privacy, and Security in Digital Business
Exploiting based pre-testing in competition environment
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
A momentum-based approach to learning nash equilibria
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Recursive adaptation of stepsize parameter for non-stationary environments
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Adaption of stepsize parameter using newton's method
PRIMA'11 Proceedings of the 14th international conference on Agents in Principle, Agents in Practice
An overview of cooperative and competitive multiagent learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Learning pareto-optimal solutions in 2x2 conflict games
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Unifying convergence and no-regret in multiagent learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
A probability collectives approach with a feasibility-based rule for constrained optimization
Applied Computational Intelligence and Soft Computing
Centralized and distributed task allocation in multi-robot teams via a stochastic clustering auction
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An extension of a hierarchical reinforcement learning algorithm for multiagent settings
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
When speed matters in learning against adversarial opponents
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
A common gradient in multi-agent reinforcement learning
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Learning to achieve socially optimal solutions in general-sum games
PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Multirobot behavior synchronization through direct neural network communication
ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part II
Promoting cooperation in service-oriented MAS through social plasticity and incentives
Journal of Systems and Software
Continuous strategy replicator dynamics for multi-agent Q-learning
Autonomous Agents and Multi-Agent Systems
Multi-agent learning and the reinforcement gradient
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Norm Emergence with Biased Agents
International Journal of Agent Technologies and Systems
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Expert Systems with Applications: An International Journal
Emergence of social norms through collective learning in networked agent societies
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Modeling non-stationary opponents
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Machine learning for interactive systems and robots: a brief introduction
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A reinforcement learning-based routing for delay tolerant networks
Engineering Applications of Artificial Intelligence
Strategic interactions among agents with bounded rationality
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Multiagent meta-level control for radar coordination
Web Intelligence and Agent Systems
Hi-index | 0.00 |
Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on the policies of the other agents. This creates a situation of learning a moving target. Previous learning algorithms have one of two shortcomings depending on their approach. They either converge to a policy that may not be optimal against the specific opponents' policies, or they may not converge at all. In this article we examine this learning problem in the framework of stochastic games. We look at a number of previous learning algorithms showing how they fail at one of the above criteria. We then contribute a new reinforcement learning technique using a variable learning rate to overcome these shortcomings. Specifically, we introduce the WoLF principle, "Win or Learn Fast", for varying the learning rate. We examine this technique theoretically, proving convergence in self-play on a restricted class of iterated matrix games. We also present empirical results on a variety of more general stochastic games, in situations of self-play and otherwise, demonstrating the wide applicability of this method.