Planning, learning and coordination in multiagent decision processes

Authors:
Craig Boutilier
Affiliations:
University of British Columbia, Vancouver, Canada
Venue:
TARK '96 Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge
Year:
1996

Citing 24
Cited 44

A model for reasoning about persistence and causation

Computational Intelligence
Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Proceedings of the seventh international conference (1990) on Machine learning
Planning and control

Planning and control
Technical Note: \cal Q-Learning

Machine Learning
The Convergence of TD(λ) for General λ

Machine Learning
Learning in embedded systems

Learning in embedded systems
An adaptive communication protocol for cooperating mobile robots

Proceedings of the second international conference on From animals to animats 2 : simulation of adaptive behavior: simulation of adaptive behavior
Divide and conquer in multi-agent planning

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Learning to coordinate without sharing information

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Using abstractions for decision-theoretic planning with time constraints

AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
Acting optimally in partially observable stochastic domains

AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
Control strategies for a stochastic planner

AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
Learning to act using real-time dynamic programming

Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
The Parti-game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-spaces

Machine Learning
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Feudal Reinforcement Learning

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Multiagent Coordination with Learning Classifier Systems

IJCAI '95 Proceedings of the Workshop on Adaption and Learning in Multi-Agent Systems
Dynamic Programming

Dynamic Programming
Probabilistic robot navigation in partially observable environments

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Process-oriented planning and average-reward optimality

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Exploiting structure in policy construction

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Decomposition techniques for planning in stochastic domains

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Learning joint coordinated plans in multi-agent systems

IEA/AIE'2003 Proceedings of the 16th international conference on Developments in applied artificial intelligence
Optimal on-line scheduling in stochastic multiagent systems in continuous space-time

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Q-value functions for decentralized POMDPs

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Aligning social welfare and agent preferences to alleviate traffic congestion

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Emerging coordination in infinite team Markov games

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Interaction-driven Markov games for decentralized multiagent planning under uncertainty

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Using the Simulated Annealing Algorithm for Multiagent Decision Making

RoboCup 2006: Robot Soccer World Cup X
An Experimental Study of Anticipation in Simple Robot Navigation

Anticipatory Behavior in Adaptive Learning Systems
VWM: An Improvement to Multiagent Coordination in Highly Dynamic Environments

MATES '07 Proceedings of the 5th German conference on Multiagent System Technologies
Commitment-based service coordination

International Journal of Agent-Oriented Software Engineering
Efficient metadeliberation auctions

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Partial-order planning with concurrent interacting actions

Journal of Artificial Intelligence Research
The communicative multiagent team decision problem: analyzing teamwork theories and models

Journal of Artificial Intelligence Research
Hybrid BDI-POMDP framework for multiagent teaming

Journal of Artificial Intelligence Research
Graphical model inference in optimal control of stochastic multi-agent systems

Journal of Artificial Intelligence Research
Optimal and approximate Q-value functions for decentralized POMDPs

Journal of Artificial Intelligence Research
Taming decentralized POMDPs: towards efficient policy computation for multiagent settings

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Multi-agent systems by incremental gradient reinforcement learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Research on improvement of model-free average reward reinforcement learning and its simulation experiment

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Learning coordination in RoboCupRescue

AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Optimal convergence in multi-agent MDPs

KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
An agent reinforcement learning model based on neural networks

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Agent-based coordination of human-multirobot teams in complex environments

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Industry track
Learning multi-agent state space representations

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Coordinated learning in multiagent MDPs with infinite state-space

Autonomous Agents and Multi-Agent Systems
An investigation into mathematical programming for finite horizon decentralized POMDPs

Journal of Artificial Intelligence Research
Planning with concurrent interacting actions

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Generalized learning automata for multi-agent reinforcement learning

AI Communications - European Workshop on Multi-Agent Systems (EUMAS) 2009
Networks of learning automata and limiting games

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation

Multiagent and Grid Systems
Online planning for multi-agent systems with bounded communication

Artificial Intelligence
Decentralized MDPs with sparse interactions

Artificial Intelligence
Learning conventions in multiagent stochastic domains using likelihood estimates

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence
Coordinating teams in uncertain environments: a hybrid BDI-POMDP approach

ProMAS'04 Proceedings of the Second international conference on Programming Multi-Agent Systems
Desire-space analysis and action selection for multiple dynamic goals

CLIMA'04 Proceedings of the 5th international conference on Computational Logic in Multi-Agent Systems
Learning automata as a basis for multi agent reinforcement learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Solving sparse delayed coordination problems in multi-agent reinforcement learning

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Online planning for ad hoc autonomous agent teams

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Value-function reinforcement learning in Markov games

Cognitive Systems Research
Exploiting independent relationships in multiagent systems for coordinated learning

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Simulating UAV Surveillance for Analyzing Impact of Commitments in Multi-Agent Systems

International Journal of Agent Technologies and Systems
Decentralized multi-robot cooperation with auctioned POMDPs

International Journal of Robotics Research
Sufficient plan-time statistics for decentralized POMDPs

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

There has been a growing interest in AI in the design of multiagent systems, especially in multiagent cooperative planning. In this paper, we investigate the extent to which methods from single-agent planning and learning can be applied in multiagent settings. We survey a number of different techniques from decision-theoretic planning and reinforcement learning and describe a number of interesting issues that arise with regard to coordinating the policies of individual agents. To this end, we describe multiagent Markov decision processes as a general model in which to frame this discussion. These are special n-person cooperative games in which agents share the same utility function. We discuss coordination mechanisms based on imposed conventions (or social laws) as well as learning methods for coordination. Our focus is on the decomposition of sequential decision processes so that coordination can be learned (or imposed) locally, at the level of individual states. We also discuss the use of structured problem representations and their role in the generalization of learned conventions and in approximation.