Solving factored MDPs with hybrid state and action variables

  • Authors:
  • Branislav Kveton;Milos Hauskrecht;Carlos Guestrin

  • Affiliations:
  • Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA;Department of Computer Science, University of Pittsburgh, Pittsburgh, PA;Machine Learning Department and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA

  • Venue:
  • Journal of Artificial Intelligence Research
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automated decision support systems. In this paper, we describe a novel hybrid factored Markov decision process (MDP) model that allows for a compact representation of these problems, and a new hybrid approximate linear programming (HALP) framework that permits their efficient solutions. The central idea of HALP is to approximate the optimal value function by a linear combination of basis functions and optimize its weights by linear programming. We analyze both theoretical and computational aspects of this approach, and demonstrate its scale-up potential on several hybrid optimization problems.