Reinforcement learning estimation of distribution algorithm

  • Authors:
  • Topon Kumar Paul;Hitoshi Iba

  • Affiliations:
  • Graduate School of Frontier Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan;Graduate School of Frontier Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan

  • Venue:
  • GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions to generate a new population of solutions. We call it Reinforcement Learning Estimation of Distribution Algorithm (RELEDA). For the estimation of the joint probability distribution we consider each variable as univariate. Then we update the probability of each variable by applying reinforcement learning method. Though we consider variables independent of one another, the proposed method can solve problems of highly correlated variables. To compare the efficiency of our proposed algorithm with other Estimation of Distribution Algorithms (EDAs) we provide the experimental results of the two problems: four peaks problem and bipolar function.