Adaptive evolutionary programming based on reinforcement learning

Authors:
Huaxiang Zhang;Jing Lu
Affiliations:
Department of Computer Science, Shandong Normal University, Jinan 250014, Shandong China;Department of Computer Science, Shandong University of Finance, Jinan 250014, Shandong China
Venue:
Information Sciences: an International Journal
Year:
2008

Citing 12
Cited 5

Evolving artificial intelligence

Evolving artificial intelligence
Evolutionary computation: toward a new philosophy of machine intelligence

Evolutionary computation: toward a new philosophy of machine intelligence
Intelligence through simulated evolution: forty years of evolutionary programming

Intelligence through simulated evolution: forty years of evolutionary programming
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES)

Evolutionary Computation
A single-point mutation evolutionary programming

Information Processing Letters
DE/EDA: a new evolutionary algorithm for global optimization

Information Sciences—Informatics and Computer Science: An International Journal
Human evolutionary model: A new approach to optimization

Information Sciences: an International Journal
An overview of evolutionary algorithms for parameter optimization

Evolutionary Computation
A study of particle swarm optimization particle trajectories

Information Sciences: an International Journal
Evolutionary programming made faster

IEEE Transactions on Evolutionary Computation
Evolutionary programming using mutations based on the Levy probability distribution

IEEE Transactions on Evolutionary Computation

Oppositional biogeography-based optimization

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks

Information Sciences: an International Journal
Hessian matrix distribution for Bayesian policy gradient reinforcement learning

Information Sciences: an International Journal
Function optimisation by learning automata

Information Sciences: an International Journal
An evolutionary programming algorithm for survivable routing and wavelength assignment in transparent optical networks

Information Sciences: an International Journal

Quantified Score

Hi-index	0.07

Visualization

Abstract

This paper studies evolutionary programming and adopts reinforcement learning theory to learn individual mutation operators. A novel algorithm named RLEP (Evolutionary Programming based on Reinforcement Learning) is proposed. In this algorithm, each individual learns its optimal mutation operator based on the immediate and delayed performance of mutation operators. Mutation operator selection is mapped into a reinforcement learning problem. Reinforcement learning methods are used to learn optimal policies by maximizing the accumulated rewards. According to the calculated Q function value of each candidate mutation operator, an optimal mutation operator can be selected to maximize the learned Q function value. Four different mutation operators have been employed as the basic candidate operators in RLEP and one is selected for each individual in different generations. Our simulation shows the performance of RLEP is the same as or better than the best of the four basic mutation operators.