Least-Squares Temporal Difference Learning

Authors:
Justin A. Boyan
Affiliations:
-
Venue:
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Year:
1999

Citing 0
Cited 33

Relative Loss Bounds for Temporal-Difference Learning

Machine Learning
Solving factored MDPs using non-homogeneous partitions

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
Extending XCSF beyond linear approximation

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Improving generalization in the XCSF classifier system using linear least-squares

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning

Discrete Event Dynamic Systems
Kernel rewards regression: an information efficient batch policy iteration approach

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Performance Loss Bounds for Approximate Value Iteration with State Aggregation

Mathematics of Operations Research
Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension

Evolutionary Computation
Natural Actor-Critic

Neurocomputing
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

Proceedings of the 25th international conference on Machine learning
Preconditioned temporal difference learning

Proceedings of the 25th international conference on Machine learning
Sigma point policy iteration

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Feed-Forward Learning: Fast Reinforcement Learning of Controllers

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes

Simulation
New Error Bounds for Approximations from Projected Linear Equations

Recent Advances in Reinforcement Learning
Kernelized value function approximation for reinforcement learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Learning Representation and Control in Markov Decision Processes: New Frontiers

Foundations and Trends® in Machine Learning
Incremental least-squares temporal difference learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Hybrid least-squares algorithms for approximate policy evaluation

Machine Learning
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Natural actor-critic algorithms

Automatica (Journal of IFAC)
Efficient skill learning using abstraction selection

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Model-based least-squares policy evaluation

AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Q-learning with linear function approximation

COLT'07 Proceedings of the 20th annual conference on Learning theory
Impedance learning for robotic contact tasks using natural actor-critic algorithm

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Error Bounds for Approximations from Projected Linear Equations

Mathematics of Operations Research
Continuous state/action reinforcement learning: A growing self-organizing map approach

Neurocomputing
Monte Carlo matrix inversion policy evaluation

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Natural actor-critic

ECML'05 Proceedings of the 16th European conference on Machine Learning
Q-error as a selection mechanism in modular reinforcement-learning systems

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Two-step gradient-based reinforcement learning for underwater robotics behavior learning

Robotics and Autonomous Systems
Finite-sample analysis of least-squares policy iteration

The Journal of Machine Learning Research
Better generalization with forecasts

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Least-Squares Temporal Difference Learning

Quantified Score

Visualization

Abstract