Switching between different state representations in reinforcement learning

Authors:
Harm van Seijen;Bram Bakker;Leon Kester
Affiliations:
Integrated Systems, TNO Defense, Safety and Security and University of Amsterdam;University of Amsterdam;Integrated Systems, TNO Defense, Safety and Security and University of Amsterdam
Venue:
AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
Year:
2008

Citing 8
Cited 0

Practical Issues in Temporal Difference Learning

Machine Learning
HQ-learning

Adaptive Behavior
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Artificial Intelligence
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Multiple model-based reinforcement learning

Neural Computation
Adaptive mixtures of local experts

Neural Computation
Foundations and Applications of Sensor Management

Foundations and Applications of Sensor Management
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

The aim of this paper is to devise a new PC-algorithm (partial correlation), uPC-algorithm, for estimating a high dimensional undirected graph associated to a faithful Gaussian Graphical Model. First, we define the separability order ...