Context-specific multiagent coordination and planning with factored MDPs

Authors:
Carlos Guestrin;Shobha Venkataraman;Daphne Koller
Affiliations:
Computer Science Dept. Stanford University;Computer Science Dept. Stanford University;Computer Science Dept. Stanford University
Venue:
Eighteenth national conference on Artificial intelligence
Year:
2002

Citing 7
Cited 28

A model for reasoning about persistence and causation

Computational Intelligence
Bucket elimination: a unifying framework for reasoning

Artificial Intelligence
Nonserial Dynamic Programming

Nonserial Dynamic Programming
On the Role of Context-Specific Independence in Probabilistic Inference

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Computing Factored Value Functions for Policies in Structured MDPs

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Max-norm projections for factored MDPs

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Multi-agent influence diagrams for representing and solving games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2

Role allocation and reallocation in multiagent teams: towards a practical analysis

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Sparse cooperative Q-learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Multi-Agent Planning in Complex Uncertain Environments

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1
Coordinating Multiple Agents via Reinforcement Learning

Autonomous Agents and Multi-Agent Systems
Some agent theory for the semantic web

ACM SIGSOFT Software Engineering Notes
Collaborative Multiagent Reinforcement Learning by Payoff Propagation

The Journal of Machine Learning Research
Exploiting factored representations for decentralized execution in multiagent teams

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Optimal on-line scheduling in stochastic multiagent systems in continuous space-time

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Real World Multi-agent Systems: Information Sharing, Coordination and Planning

Logic, Language, and Computation
Using the Simulated Annealing Algorithm for Multiagent Decision Making

RoboCup 2006: Robot Soccer World Cup X
Solving multiagent assignment Markov decision processes

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Learning of coordination: exploiting sparse interactions in multiagent systems

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Networked distributed POMDPs: a synthesis of distributed constraint optimization and POMDPs

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Exploiting contextual independence in probabilistic inference

Journal of Artificial Intelligence Research
Efficient solution algorithms for factored MDPs

Journal of Artificial Intelligence Research
Solving transition independent decentralized Markov decision processes

Journal of Artificial Intelligence Research
Hybrid BDI-POMDP framework for multiagent teaming

Journal of Artificial Intelligence Research
Solving factored MDPs with hybrid state and action variables

Journal of Artificial Intelligence Research
Graphical model inference in optimal control of stochastic multi-agent systems

Journal of Artificial Intelligence Research
Generalizing plans to new environments in relational MDPs

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Optimal control in large stochastic multi-agent systems

ALAMAS'05/ALAMAS'06/ALAMAS'07 Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning
Relational preference rules for control

Artificial Intelligence
Decentralized MDPs with sparse interactions

Artificial Intelligence
Assessing optimal assignment under uncertainty: An interval-based algorithm

International Journal of Robotics Research
Coordinating teams in uncertain environments: a hybrid BDI-POMDP approach

ProMAS'04 Proceedings of the Second international conference on Programming Multi-Agent Systems
Traffic flow harmonization in expressway merging

Personal and Ubiquitous Computing
IMAQCS: Design and implementation of an intelligent multi-agent system for monitoring and controlling quality of cement production processes

Computers in Industry
TESLA: an extended study of an energy-saving agent that leverages schedule flexibility

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an algorithm for coordinated decision making in cooperative multiagent settings, where the agents' value function canbe represented as a sum of context-specific value rules. The task of finding an optimal joint action in this setting leads to an algorithm where the coordination structure between agents depends on the current state of the system and even on the actual mmaerical values assigned to the value rules. We apply this framework to the task of multiagent planning in dynamic systems, showing how a joint value function of the associated Markov Decision Process can be approximated as a set of value rules using an efficient linear programming algorithm. The agents then apply the coordination graph algorithm at each iteration of the process to decide on the highest-value joint action, potentially leading to a different coordination pattern at each step of the plan.