Compound reinforcement learning: theory and an application to finance

  • Authors:
  • Tohgoroh Matsui;Takashi Goto;Kiyoshi Izumi;Yu Chen

  • Affiliations:
  • Chubu University, Kasugai, Japan;Bank of Tokyo-Mitsubishi UFJ, Ltd., Tokyo, Japan;The University of Tokyo, Tokyo, Japan,JST PRESTO, Tokyo, Japan;The University of Tokyo, Tokyo, Japan

  • Venue:
  • EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes compound reinforcement learning (RL) that is an extended RL based on the compound return. Compound RL maximizes the logarithm of expected double-exponentially discounted compound return in return-based Markov decision processes (MDPs). The contributions of this paper are (1) Theoretical description of compound RL that is an extended RL framework for maximizing the compound return in a return-based MDP and (2) Experimental results in an illustrative example and an application to finance.