Controlling deliberation in a Markov decision process-based agent

Authors:
George Alexander;Anita Raja;David J. Musliner
Affiliations:
The University of North Carolina at Charlotte, Charlotte, NC;The University of North Carolina at Charlotte, Charlotte, NC;Honeywell Laboratories, Minneapolis, MN
Venue:
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Year:
2008

Citing 11
Cited 3

Principles of metareasoning

Proceedings of the first international conference on Principles of knowledge representation and reasoning
Monitoring and control of anytime algorithms: a dynamic programming approach

Artificial Intelligence - special issue on computational tradeoffs under bounded resources
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Metacognition in computation: a selected research review

Artificial Intelligence
A framework for meta-level control in multi-agent systems

Autonomous Agents and Multi-Agent Systems
Solving large TÆMS problems efficiently by selective exploration and decomposition

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Using performance profile trees to improve deliberation control

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Definition and complexity of some basic metareasoning problems

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Managing online self-adaptation in real-time environments

IWSAS'01 Proceedings of the 2nd international conference on Self-adaptive software: applications
On the complexity of solving Markov decision problems

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence

Determining the value of information for collaborative multi-agent planning

Autonomous Agents and Multi-Agent Systems
Using conflict resolution to inform decentralized learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Multiagent meta-level control for radar coordination

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision Process (MDP)-based scheduling agent. The agent's reasoning process involves continuous partial unrolling of the MDP state space and periodic reprioritization of the states to be expanded. The meta-level controller makes situation-specific decisions on when the agent should stop unrolling in order to derive a partial policy while bounding the costs of state reprioritization. The described approach uses performance profiling combined with multi-level strategies in its decision making. We present results showing the performance advantage of dynamic meta-level control for this complex agent.