DCOPs meet the realworld: exploring unknown reward matrices with applications to mobile sensor networks

Authors:
Manish Jain;Matthew Taylor;Milind Tambe;Makoto Yokoo
Affiliations:
University of Southern California, Los Angeles, CA;University of Southern California, Los Angeles, CA;University of Southern California, Los Angeles, CA;Kyushu University, Fukuoka, Japan
Venue:
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Year:
2009

Citing 11
Cited 17

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
An analysis and application of distributed constraint satisfaction and optimization algorithms in sensor networks

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Distributed Sensor Networks: A Multiagent Perspective

Distributed Sensor Networks: A Multiagent Perspective
Solving Distributed Constraint Optimization Problems Using Cooperative Mediation

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1
A distributed framework for solving the Multiagent Plan Coordination Problem

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Adopt: asynchronous distributed constraint optimization with quality guarantees

Artificial Intelligence - Special issue: Distributed constraint satisfaction
Wireless Communications

Wireless Communications
Decentralised coordination of low-power embedded devices using the max-sum algorithm

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Robust and self-repairing formation control for swarms of mobile agents

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Quality guarantees on k-optimal solutions for distributed constraint optimization problems

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A scalable method for multiagent constraint optimization

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence

When should there be a "Me" in "Team"?: distributed multi-agent optimization under uncertainty

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Deception in networks of mobile sensing agents

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
BnB-ADOPT: an asynchronous branch-and-bound DCOP algorithm

Journal of Artificial Intelligence Research
Two decades of multiagent teamwork research: past, present, and future

CARE@AI'09/CARE@IAT'10 Proceedings of the CARE@AI 2009 and CARE@IAT 2010 international conference on Collaborative agents - research and development
Decentralized learning in wireless sensor networks

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Long-term information collection with energy harvesting wireless sensors: a multi-armed bandit based approach

Autonomous Agents and Multi-Agent Systems
Distributed constraint optimization problems related with soft arc consistency

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Multi-agent coordination: dcops and beyond

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Stochastic dominance in stochastic DCOPs for risk-sensitive applications

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Improving BnB-ADOPT+-AC

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
DCOPs and bandits: exploration and exploitation in decentralised coordination

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Removing redundant messages in N-ary BnB-ADOPT

Journal of Artificial Intelligence Research
DEECO: an ensemble-based component system

Proceedings of the 16th International ACM Sigsoft symposium on Component-based software engineering
Improved max-sum algorithm for DCOP with n-ary constraints

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Target to sensor allocation: A hierarchical dynamic Distributed Constraint Optimization approach

Computer Communications
Mitigating multi-path fading in a mobile mesh network

Ad Hoc Networks
Dynamic multiagent load balancing using distributed constraint optimization techniques

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Buoyed by recent successes in the area of distributed constraint optimization problems (DCOPs), this paper addresses challenges faced when applying DCOPs to real-world domains. Three fundamental challenges must be addressed for a class of real-world domains, requiring novel DCOP algorithms. First, agents may not know the payoff matrix and must explore the environment to determine rewards associated with variable settings. Second, agents may need to maximize total accumulated reward rather than instantaneous final reward. Third, limited time horizons disallow exhaustive exploration of the environment. We propose and implement a set of novel algorithms that combine decision-theoretic exploration approaches with DCOP-mandated coordination. In addition to simulation results, we implement these algorithms on robots, deploying DCOPs on a distributed mobile sensor network.