Technical Note: \cal Q-Learning
Machine Learning
Learning to coordinate without sharing information
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Planning, learning and coordination in multiagent decision processes
TARK '96 Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
Learning to coordinate actions in multi-agent systems
IJCAI'93 Proceedings of the 13th international joint conference on Artifical intelligence - Volume 1
Learning conventions in multiagent stochastic domains using likelihood estimates
UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
General principles of learning-based multi-agent systems
Proceedings of the third annual conference on Autonomous Agents
Conjectural Equilibrium in Multiagent Learning
Machine Learning
Adaptivity in agent-based routing for data networks
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Multiagent learning using a variable learning rate
Artificial Intelligence
A multiagent reinforcement learning algorithm using extended optimal response
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Learning sequences of actions in collectives of autonomous agents
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 1
Designing agent collectives for systems with markovian dynamics
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning to select a coordination mechanism
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Learning and decision: making for intention reconciliation
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
A Model of Partially Observable State Game and its Optimality
Applied Intelligence
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Social Agents Playing a Periodical Policy
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Convergent Gradient Ascent in General-Sum Games
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Learning to Reach the Pareto Optimal Nash Equilibrium as a Team
AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Sequential Strategy for Learning Multi-stage Multi-agent Collaborative Games
ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Learning Multi-agent Strategies in Multi-stage Collaborative Games
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Rationality Assumptions and Optimality of Co-learning
PRIMA '00 Proceedings of the Third Pacific Rim International Workshop on Multi-Agents: Design and Applications of Intelligent Agents
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer
RoboCup 2001: Robot Soccer World Cup V
Proceedings of the workshop on Deception, Fraud, and Trust in Agent Societies held during the Autonomous Agents Conference: Trust in Cyber-societies, Integrating the Human and Artificial Perspectives
Implicit Negotiation in Repeated Games
ATAL '01 Revised Papers from the 8th International Workshop on Intelligent Agents VIII
Reinforcement learning of coordination in cooperative multi-agent systems
Eighteenth national conference on Artificial intelligence
Dispersion games: general definitions and some specific learning results
Eighteenth national conference on Artificial intelligence
Optimizing information exchange in cooperative multi-agent systems
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Resource allocation games with changing resource capacities
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Towards a pareto-optimal solution in general-sum games
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Adaptive policy gradient in multiagent learning
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Coordination in multiagent reinforcement learning: a Bayesian approach
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Nash q-learning for general-sum stochastic games
The Journal of Machine Learning Research
ICML '04 Proceedings of the twenty-first international conference on Machine learning
A multi-agent system integrating reinforcement learning, bidding and genetic algorithms
Web Intelligence and Agent Systems
Learning when and how to coordinate
Web Intelligence and Agent Systems
Best-Response Multiagent Learning in Non-Stationary Environments
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
The Role of Reactivity in Multiagent Learning
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Coordination through Mutual Notification in Cooperative Multiagent Reinforcement Learning
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Reinforcement Learning of Coordination in Heterogeneous Cooperative Multi-Agent Systems
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Efficient learning equilibrium
Artificial Intelligence
Asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
A Model of Adaptation in Collaborative Multi-Agent Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Theory of moves learners: towards non-myopic equilibria
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Cooperative Multi-Agent Learning: The State of the Art
Autonomous Agents and Multi-Agent Systems
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games
Autonomous Agents and Multi-Agent Systems
Learning against multiple opponents
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Selecting informative actions improves cooperative multiagent learning
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Lenient learners in cooperative multiagent systems
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Can good learners always compensate for poor learners?
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Learning the task allocation game
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Multi-agent learning model with bargaining
Proceedings of the 38th conference on Winter simulation
Gradient descent for symmetric and asymmetric multiagent reinforcement learning
Web Intelligence and Agent Systems
If multi-agent learning is the answer, what is the question?
Artificial Intelligence
Perspectives on multiagent learning
Artificial Intelligence
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
The Journal of Machine Learning Research
Exploring selfish reinforcement learning in repeated games with stochastic rewards
Autonomous Agents and Multi-Agent Systems
Reactivity and Safe Learning in Multi-Agent Systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A layered approach to learning coordination knowledge in multiagent environments
Applied Intelligence
Generalized multiagent learning with performance bound
Autonomous Agents and Multi-Agent Systems
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Multiagent learning in adaptive dynamic systems
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective
The Journal of Machine Learning Research
A few good agents: multi-agent social learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Using the Simulated Annealing Algorithm for Multiagent Decision Making
RoboCup 2006: Robot Soccer World Cup X
Competition and Coordination in Stochastic Games
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Multi-agent Learning Dynamics: A Survey
CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
VWM: An Improvement to Multiagent Coordination in Highly Dynamic Environments
MATES '07 Proceedings of the 5th German conference on Multiagent System Technologies
A Learning Automata Approach to Multi-agent Policy Gradient Learning
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Online Multiagent Learning against Memory Bounded Adversaries
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Optimistic-Pessimistic Q-Learning Algorithm for Multi-Agent Systems
MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
An adaptive policy gradient in learning Nash equilibria
Neurocomputing
Individual and Social Behaviour in the IPA Market with RL
SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
Autonomous Agents and Multi-Agent Systems
Learning the IPA market with individual and social rewards
Web Intelligence and Agent Systems
Dynamic analysis of multiagent Q-learning with ε-greedy exploration
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multiagent learning in large anonymous games
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Learning of coordination: exploiting sparse interactions in multiagent systems
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Performance bounded reinforcement learning in strategic interactions
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Point-based dynamic programming for DEC-POMDPs
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games
Web Intelligence and Agent Systems
Efficient no-regret multiagent learning
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Coordination and adaptation in impromptu teams
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Optimal efficient learning equilibrium: imperfect monitoring in symmetric games
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Collective intelligence, data routing and braess' paradox
Journal of Artificial Intelligence Research
Learning to Coordinate Efficiently: a model-based approach
Journal of Artificial Intelligence Research
Decentralized control of cooperative systems: categorization and complexity analysis
Journal of Artificial Intelligence Research
Existence of multiagent equilibria with limited agents
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
A multiagent reinforcement learning algorithm with non-linear dynamics
Journal of Artificial Intelligence Research
Predicting and preventing coordination problems in cooperative Q-learning systems
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Improving coevolutionary search for optimal multiagent behaviors
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Simultaneous adversarial multi-robot learning
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Reinforcement learning in distributed domains: beyond team games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Rational and convergent learning in stochastic games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning against opponents with bounded memory
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Effective learning in the presence of adaptive counterparts
Journal of Algorithms
A multi-agent learning approach to online distributed resource allocation
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Anytime Self-play Learning to Satisfy Functional Optimality Criteria
ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
Hybrid Q-learning algorithm about cooperation in MAS
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Cooperative multi-robot reinforcement learning: a framework in hybrid state space
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Learning in groups of traffic signals
Engineering Applications of Artificial Intelligence
Optimal convergence in multi-agent MDPs
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
Adaptation in games with many co-evolving agents
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Convergence of independent adaptive learners
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Towards a taxonomy of decision making problems in multi-agent systems
MATES'09 Proceedings of the 7th German conference on Multiagent system technologies
Reinforcement learning approaches to coordination in cooperative multi-agent systems
Adaptive agents and multi-agent systems
Cooperative learning using advice exchange
Adaptive agents and multi-agent systems
Coevolution of heterogeneous multi-robot teams
Proceedings of the 12th annual conference on Genetic and evolutionary computation
To teach or not to teach?: decision making under uncertainty in ad hoc teams
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Learning multi-agent state space representations
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Evolving policy geometry for scalable multiagent learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Action discovery for reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Coordinated learning in multiagent MDPs with infinite state-space
Autonomous Agents and Multi-Agent Systems
From cognition to docition: The teaching radio paradigm for distributed & autonomous deployments
Computer Communications
Multi-policy optimization in self-organizing systems
SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Social conformity and its convergence for reinforcement learning
MATES'10 Proceedings of the 8th German conference on Multiagent system technologies
Evolutionary dynamics of regret minimization
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Generalized learning automata for multi-agent reinforcement learning
AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Solving multi-stage games with hierarchical learning automata that bootstrap
ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Improving space representation in multiagent learning via tile coding
SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
Theoretical convergence guarantees for cooperative coevolutionary algorithms
Evolutionary Computation
A novel multi-agent reinforcement learning approach for job scheduling in Grid computing
Future Generation Computer Systems
Speeding up learning automata based multi agent systems using the concepts of stigmergy and entropy
Expert Systems with Applications: An International Journal
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Decentralized MDPs with sparse interactions
Artificial Intelligence
Multiagent learning in large anonymous games
Journal of Artificial Intelligence Research
Theoretical considerations of potential-based reward shaping for multi-agent systems
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Learning to cooperate via policy search
UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
A momentum-based approach to learning nash equilibria
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
Feature extraction for decision-theoretic planning in partially observable environments
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Meta-game equilibrium for multi-agent reinforcement learning
AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Multi-agent case-based reasoning for cooperative reinforcement learners
ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
Coordinating learning agents for multiple resource job scheduling
ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Learning automata as a basis for multi agent reinforcement learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Learning pareto-optimal solutions in 2x2 conflict games
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
The success and failure of tag-mediated evolution of cooperation
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Multi-agent relational reinforcement learning
LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Multi-agent reinforcement learning for simulating pedestrian navigation
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Heterogeneous populations of learning agents in the minority game
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Solving sparse delayed coordination problems in multi-agent reinforcement learning
ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
A brief introduction to agent mining
Autonomous Agents and Multi-Agent Systems
Transfer learning in multi-agent reinforcement learning domains
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Coordination guided reinforcement learning
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Dynamic potential-based reward shaping
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Combining independent and joint learning: a negotiation based approach
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
GRiDA: GReen Distributed Algorithm for energy-efficient IP backbone networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Distributed learning of best response behaviors in concurrent iterated many-object negotiations
MATES'12 Proceedings of the 10th German conference on Multiagent System Technologies
Continuous strategy replicator dynamics for multi-agent Q-learning
Autonomous Agents and Multi-Agent Systems
Local coordination in online distributed constraint optimization problems
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Orchestrating multiagent learning of penalty games
SBIA'12 Proceedings of the 21st Brazilian conference on Advances in Artificial Intelligence
Expert Systems with Applications: An International Journal
Addressing the policy-bias of q-learning by repeating updates
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Distributed relational temporal difference learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Coordinating multi-agent reinforcement learning with limited communication
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Reinforcement learning for decentralized planning under uncertainty
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination
Artificial Intelligence
The dynamics of reinforcement social learning in cooperative multiagent systems
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Multiagent learning in the presence of memory-bounded agents
Autonomous Agents and Multi-Agent Systems
Hi-index | 0.00 |
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate their action choices in multi agent systems. We examine some of the factors that can influence the dynamics of the learning process in such a setting. We first distinguish reinforcement learners that are unaware of (or ignore) the presence of other agents from those that explicitly attempt to learn the value of joint actions and the strategies of their counterparts. We study (a simple form of) Q-leaming in cooperative multi agent systems under these two perspectives, focusing on the influence of that game structure and exploration strategies on convergence to (optimal and suboptimal) Nash equilibria. We then propose alternative optimistic exploration strategies that increase the likelihood of convergence to an optimal equilibrium.