An Information-Theoretic Class of Stochastic Decision Processes

  • Authors:
  • Kazunori Iwata

  • Affiliations:
  • -

  • Venue:
  • WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

Stochastic decision processes in reinforcement learning are usually formulated as Markov decision processes which are stationary and ergodic. However, in fact, some of the stochastic decision processes are not necessarily Markov, stationary, and/or ergodic. In this paper, using an information-theoretic property, we show a class of stochastic decision processes in reinforcement learning in which return maximization occurs with a positive probability. The class would be useful in considering reinforcement learning applications.