TTree: Tree-Based State Generalization with Temporally Abstract Actions

Authors:
William T. B. Uther;Manuela M. Veloso
Affiliations:
-;-
Venue:
Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Year:
2002

Citing 12
Cited 3

Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Tree based discretization for continuous state space reinforcement learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Reinforcement learning with hierarchies of machines

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
The MAXQ Method for Hierarchical Reinforcement Learning

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Intra-Option Learning about Temporally Abstract Actions

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
PEGASUS: A policy search method for large MDPs and POMDPs

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Variable resolution discretization for high-accuracy solutions of optimal control problems

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Input generalization in delayed reinforcement learning: an algorithm and performance comparisons

IJCAI'91 Proceedings of the 12th international joint conference on Artificial intelligence - Volume 2
The lumberjack algorithm for learning linked decision forests

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Generating hierarchical structure in reinforcement learning from state variables

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence

Advice generation from observed execution: abstract Markov decision process learning

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation

Multiagent and Grid Systems
Q-Tree: automatic construction of hierarchical state representation for reinforcement learning

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part III

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we describe the Trajectory Tree, or TTree, algorithm. TTree uses a small set of supplied policies to help solve a Semi-Markov Decision Problem (SMDP). The algorithm uses a learned tree based discretization of the state space as an abstract state description and both user supplied and autogenerated policies as temporally abstract actions. It uses a generative model of the world to sample the transition function for the abstract SMDP defined by those state and temporal abstractions, and then finds a policy for that abstract SMDP. This policy for the abstract SMDP can then be mapped back to a policy for the base SMDP, solving the supplied problem. In this paper we present the TTree algorithm and give empirical comparisons to other SMDP algorithms showing its effectiveness.