Technical Note: \cal Q-Learning
Machine Learning
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Mathematics of Information and Coding
Mathematics of Information and Coding
Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
A Mathematical Theory of Communication
A Mathematical Theory of Communication
Hi-index | 0.00 |
In this paper, we propose a new measure within the framework of reinforcement learning, by describing a model of an information source as a representation of a learning process. We confirm in experiments that Lempel-Ziv coding for a string of episode sequences provides a quality measure to describe the degree of complexity for learning. In addition, we discuss functions comparing expected return and its variance.