Learning and value function approximation in complex decision processes

Authors:
Benjamin Van Roy;John N. Tsitsiklis
Affiliations:
-;-
Venue:
Learning and value function approximation in complex decision processes
Year:
1998

Citing 0
Cited 16

On Average Versus Discounted Reward Temporal-Difference Learning

Machine Learning
Solving factored MDPs with continuous and discrete variables

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

Discrete Event Dynamic Systems
Autonomous shaping: knowledge transfer in reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
A New Complexity Result on Solving the Markov Decision Problem

Mathematics of Operations Research
Performance Loss Bounds for Approximate Value Iteration with State Aggregation

Mathematics of Operations Research
Valuing pilot projects in a learning by investing framework: An approximate dynamic programming approach

Computers and Operations Research
Analyzing feature generation for value-function approximation

Proceedings of the 24th international conference on Machine learning
Learning Representation and Control in Markov Decision Processes: New Frontiers

Foundations and Trends® in Machine Learning
Efficient solution algorithms for factored MDPs

Journal of Artificial Intelligence Research
Computing factored value functions for policies in structured MDPs

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
On Regression-Based Stopping Times

Discrete Event Dynamic Systems
Cultivating desired behaviour: policy teaching via environment-dynamics tweaks

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Value function approximation in zero-sum markov games

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Policy iteration for factored MDPs

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
PEGASUS: a policy search method for large MDPs and POMDPs

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract