Relative Loss Bounds for Temporal-Difference Learning
Machine Learning
Solving factored MDPs using non-homogeneous partitions
Artificial Intelligence - special issue on planning with uncertainty and incomplete information
Extending XCSF beyond linear approximation
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Improving generalization in the XCSF classifier system using linear least-squares
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning
Discrete Event Dynamic Systems
Kernel rewards regression: an information efficient batch policy iteration approach
AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
Performance Loss Bounds for Approximate Value Iteration with State Aggregation
Mathematics of Operations Research
Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension
Evolutionary Computation
Neurocomputing
Proceedings of the 25th international conference on Machine learning
Preconditioned temporal difference learning
Proceedings of the 25th international conference on Machine learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Feed-Forward Learning: Fast Reinforcement Learning of Controllers
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
New Error Bounds for Approximations from Projected Linear Equations
Recent Advances in Reinforcement Learning
Kernelized value function approximation for reinforcement learning
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Learning Representation and Control in Markov Decision Processes: New Frontiers
Foundations and Trends® in Machine Learning
Incremental least-squares temporal difference learning
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Hybrid least-squares algorithms for approximate policy evaluation
Machine Learning
Efficient reinforcement learning using recursive least-squares methods
Journal of Artificial Intelligence Research
Natural actor-critic algorithms
Automatica (Journal of IFAC)
Efficient skill learning using abstraction selection
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Model-based least-squares policy evaluation
AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Q-learning with linear function approximation
COLT'07 Proceedings of the 20th annual conference on Learning theory
Impedance learning for robotic contact tasks using natural actor-critic algorithm
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Error Bounds for Approximations from Projected Linear Equations
Mathematics of Operations Research
Monte Carlo matrix inversion policy evaluation
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
ECML'05 Proceedings of the 16th European conference on Machine Learning
Q-error as a selection mechanism in modular reinforcement-learning systems
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Two-step gradient-based reinforcement learning for underwater robotics behavior learning
Robotics and Autonomous Systems
Finite-sample analysis of least-squares policy iteration
The Journal of Machine Learning Research
Better generalization with forecasts
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Hi-index | 0.00 |