Collaborative Multiagent Reinforcement Learning by Payoff Propagation

Authors:
Jelle R. Kok;Nikos Vlassis
Affiliations:
-;-
Venue:
The Journal of Machine Learning Research
Year:
2006

Citing 39
Cited 33

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Technical Note: \cal Q-Learning

Machine Learning
Learning to coordinate without sharing information

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Temporal difference learning and TD-Gammon

Communications of the ACM
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Multiagent systems: a modern approach to distributed artificial intelligence

Multiagent systems: a modern approach to distributed artificial intelligence
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Nonserial Dynamic Programming

Nonserial Dynamic Programming
Distributed Algorithms for Multi-Robot Observation of Multiple Moving Targets

Autonomous Robots
Scaling Up Agent Coordination Strategies

Computer
Distributed Value Functions

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Planning, Learning and Coordination in Multiagent Decision Processes

Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge
Learning to Cooperate via Policy Search

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
The Complexity of Decentralized Control of Markov Decision Processes

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Coordinated Reinforcement Learning

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Context-specific multiagent coordination and planning with factored MDPs

Eighteenth national conference on Artificial intelligence
Understanding belief propagation and its generalizations

Exploring artificial intelligence in the new millennium
Transition-independent decentralized markov decision processes

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Optimizing information exchange in cooperative multi-agent systems

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Coordination in multiagent reinforcement learning: a Bayesian approach

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Constraint Processing

Constraint Processing
Distributed Sensor Networks: A Multiagent Perspective

Distributed Sensor Networks: A Multiagent Perspective
Tree consistency and bounds on the performance of the max-product algorithm and its generalizations

Statistics and Computing
Planning under uncertainty in complex structured environments

Planning under uncertainty in complex structured environments
Sparse cooperative Q-learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Preprocessing techniques for accelerating the DCOP algorithm ADOPT

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Adopt: asynchronous distributed constraint optimization with quality guarantees

Artificial Intelligence - Special issue: Distributed constraint satisfaction
Dynamic programming for partially observable stochastic games

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
The communicative multiagent team decision problem: analyzing teamwork theories and models

Journal of Artificial Intelligence Research
Decentralized control of cooperative systems: categorization and complexity analysis

Journal of Artificial Intelligence Research
Cooperative information sharing to improve distributed learning in multi-agent systems

Journal of Artificial Intelligence Research
Exploiting causal independence in Bayesian network inference

Journal of Artificial Intelligence Research
Loopy belief propagation for approximate inference: an empirical study

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
A scheme for approximating probabilistic inference

UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
Loopy belief propagation as a basis for communication in sensor networks

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Using the max-plus algorithm for multiagent decision making in coordination graphs

RoboCup 2005
Factor graphs and the sum-product algorithm

IEEE Transactions on Information Theory

Q-value functions for decentralized POMDPs

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

International Journal of Robotics Research
A distributed protocol for safe real-time planning of communicating vehicles with second-order dynamics

Proceedings of the 1st international conference on Robot communication and coordination
Exploiting locality of interaction in factored Dec-POMDPs

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Real World Multi-agent Systems: Information Sharing, Coordination and Planning

Logic, Language, and Computation
Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Solving multiagent assignment Markov decision processes

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
A data mining approach to solve the goal scoring problem

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Safe and Distributed Kinodynamic Replanning for Vehicular Networks

Mobile Networks and Applications
Learning in groups of traffic signals

Engineering Applications of Artificial Intelligence
When should there be a "Me" in "Team"?: distributed multi-agent optimization under uncertainty

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Multi-policy optimization in self-organizing systems

SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Bounded approximate decentralised coordination via the max-sum algorithm

Artificial Intelligence
Improving space representation in multiagent learning via tile coding

SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems
A Multi-agent-based voltage control in power systems using distributed reinforcement learning

Simulation
Ensemble methods for reinforcement learning with function approximation

MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Review: Reinforcement learning for context awareness and intelligence in wireless networks: Review, new features and open issues

Journal of Network and Computer Applications
Coordination guided reinforcement learning

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Decentralized Bayesian reinforcement learning for online agent collaboration

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Information sharing and searching via collaborative reinforcement learning

SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
Security aspects in the cognition cycle of distributed cognitive radio networks: a survey from a multi-agent perspective

International Journal of Ad Hoc and Ubiquitous Computing
Approximate solutions for factored Dec-POMDPs with many agents

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Using conflict resolution to inform decentralized learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Distributed relational temporal difference learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Coordinating multi-agent reinforcement learning with limited communication

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Decentralized semantic coordination via belief propagation

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Multi-objective variable elimination for collaborative graphical games

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Graphical models in continuous domains for multiagent reinforcement learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Reinforcement learning models for scheduling in wireless networks

Frontiers of Computer Science: Selected Publications from Chinese Universities
Collaborative multi-agent reinforcement learning based on a novel coordination tree frame with dynamic partition

Engineering Applications of Artificial Intelligence
Agent-based decentralised coordination for sensor networks using the max-sum algorithm

Autonomous Agents and Multi-Agent Systems
Multiagent meta-level control for radar coordination

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of coordination graphs of Guestrin, Koller, and Parr (2002a) which exploits the dependencies between agents to decompose the global payoff function into a sum of local terms. First, we deal with the single-state case and describe a payoff propagation algorithm that computes the individual actions that approximately maximize the global payoff function. The method can be viewed as the decision-making analogue of belief propagation in Bayesian networks. Second, we focus on learning the behavior of the agents in sequential decision-making tasks. We introduce different model-free reinforcement-learning techniques, unitedly called Sparse Cooperative Q-learning, which approximate the global action-value function based on the topology of a coordination graph, and perform updates using the contribution of the individual agents to the maximal global action value. The combined use of an edge-based decomposition of the action-value function and the payoff propagation algorithm for efficient action selection, result in an approach that scales only linearly in the problem size. We provide experimental evidence that our method outperforms related multiagent reinforcement-learning methods based on temporal differences.