Probabilistic reasoning in intelligent systems: networks of plausible inference
Probabilistic reasoning in intelligent systems: networks of plausible inference
Technical Note: \cal Q-Learning
Machine Learning
Learning to coordinate without sharing information
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Temporal difference learning and TD-Gammon
Communications of the ACM
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Multiagent systems: a modern approach to distributed artificial intelligence
Multiagent systems: a modern approach to distributed artificial intelligence
Markov Decision Processes: Discrete Stochastic Dynamic Programming
Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Nonserial Dynamic Programming
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Planning, Learning and Coordination in Multiagent Decision Processes
Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge
Learning to Cooperate via Policy Search
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
The Complexity of Decentralized Control of Markov Decision Processes
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Coordinated Reinforcement Learning
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Context-specific multiagent coordination and planning with factored MDPs
Eighteenth national conference on Artificial intelligence
Understanding belief propagation and its generalizations
Exploring artificial intelligence in the new millennium
Transition-independent decentralized markov decision processes
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Optimizing information exchange in cooperative multi-agent systems
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Coordination in multiagent reinforcement learning: a Bayesian approach
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Constraint Processing
Distributed Sensor Networks: A Multiagent Perspective
Distributed Sensor Networks: A Multiagent Perspective
Tree consistency and bounds on the performance of the max-product algorithm and its generalizations
Statistics and Computing
Planning under uncertainty in complex structured environments
Planning under uncertainty in complex structured environments
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Preprocessing techniques for accelerating the DCOP algorithm ADOPT
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Adopt: asynchronous distributed constraint optimization with quality guarantees
Artificial Intelligence - Special issue: Distributed constraint satisfaction
Dynamic programming for partially observable stochastic games
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
The communicative multiagent team decision problem: analyzing teamwork theories and models
Journal of Artificial Intelligence Research
Decentralized control of cooperative systems: categorization and complexity analysis
Journal of Artificial Intelligence Research
Cooperative information sharing to improve distributed learning in multi-agent systems
Journal of Artificial Intelligence Research
Exploiting causal independence in Bayesian network inference
Journal of Artificial Intelligence Research
Loopy belief propagation for approximate inference: an empirical study
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
A scheme for approximating probabilistic inference
UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
Loopy belief propagation as a basis for communication in sensor networks
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Factor graphs and the sum-product algorithm
IEEE Transactions on Information Theory
Q-value functions for decentralized POMDPs
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning
International Journal of Robotics Research
Proceedings of the 1st international conference on Robot communication and coordination
Exploiting locality of interaction in factored Dec-POMDPs
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Real World Multi-agent Systems: Information Sharing, Coordination and Planning
Logic, Language, and Computation
Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Solving multiagent assignment Markov decision processes
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
A data mining approach to solve the goal scoring problem
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Safe and Distributed Kinodynamic Replanning for Vehicular Networks
Mobile Networks and Applications
Learning in groups of traffic signals
Engineering Applications of Artificial Intelligence
When should there be a "Me" in "Team"?: distributed multi-agent optimization under uncertainty
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Multi-policy optimization in self-organizing systems
SOAR'09 Proceedings of the First international conference on Self-organizing architectures
Bounded approximate decentralised coordination via the max-sum algorithm
Artificial Intelligence
Improving space representation in multiagent learning via tile coding
SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Ensemble methods for reinforcement learning with function approximation
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Journal of Network and Computer Applications
Coordination guided reinforcement learning
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Decentralized Bayesian reinforcement learning for online agent collaboration
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Information sharing and searching via collaborative reinforcement learning
SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
International Journal of Ad Hoc and Ubiquitous Computing
Approximate solutions for factored Dec-POMDPs with many agents
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Using conflict resolution to inform decentralized learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Distributed relational temporal difference learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Coordinating multi-agent reinforcement learning with limited communication
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Decentralized semantic coordination via belief propagation
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Multi-objective variable elimination for collaborative graphical games
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Graphical models in continuous domains for multiagent reinforcement learning
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Reinforcement learning models for scheduling in wireless networks
Frontiers of Computer Science: Selected Publications from Chinese Universities
Engineering Applications of Artificial Intelligence
Agent-based decentralised coordination for sensor networks using the max-sum algorithm
Autonomous Agents and Multi-Agent Systems
Multiagent meta-level control for radar coordination
Web Intelligence and Agent Systems
Hi-index | 0.00 |
In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of coordination graphs of Guestrin, Koller, and Parr (2002a) which exploits the dependencies between agents to decompose the global payoff function into a sum of local terms. First, we deal with the single-state case and describe a payoff propagation algorithm that computes the individual actions that approximately maximize the global payoff function. The method can be viewed as the decision-making analogue of belief propagation in Bayesian networks. Second, we focus on learning the behavior of the agents in sequential decision-making tasks. We introduce different model-free reinforcement-learning techniques, unitedly called Sparse Cooperative Q-learning, which approximate the global action-value function based on the topology of a coordination graph, and perform updates using the contribution of the individual agents to the maximal global action value. The combined use of an edge-based decomposition of the action-value function and the payoff propagation algorithm for efficient action selection, result in an approach that scales only linearly in the problem size. We provide experimental evidence that our method outperforms related multiagent reinforcement-learning methods based on temporal differences.