Analysis of a natural gradient algorithm on monotonic convex-quadratic-composite functions

Authors:
Youhei Akimoto
Affiliations:
INRIA-Saclay, Université Paris-Sud, Orsay, France
Venue:
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Year:
2012

Citing 12
Cited 0

Natural gradient works efficiently in learning

Neural Computation
Adaptive natural gradient learning algorithms for various stochastic models

Neural Networks
Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES)

Evolutionary Computation
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

The Journal of Machine Learning Research
Convergence results for the (1, λ)-SA-ES using the theory of ϕ-irreducible Markov chains

Theoretical Computer Science
Completely Derandomized Self-Adaptation in Evolution Strategies

Evolutionary Computation
Adaptive Method of Realizing Natural Gradient Learning for Multilayer Perceptrons

Neural Computation
Algorithmic analysis of a basic evolutionary algorithm for continuous optimization

Theoretical Computer Science
Natural Actor-Critic

Neurocomputing
Natural actor-critic algorithms

Automatica (Journal of IFAC)
Exponential natural evolution strategies

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Bidirectional relation between CMA evolution strategies and natural evolution strategies

PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we investigate the convergence properties of a variant of the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). Our study is based on the recent theoretical foundation that the pure rank-μ update CMA-ES performs the natural gradient descent on the parameter space of Gaussian distributions. We derive a novel variant of the natural gradient method where the parameters of the Gaussian distribution are updated along the natural gradient to improve a newly defined function on the parameter space. We study this algorithm on composites of a monotone function with a convex quadratic function. We prove that our algorithm adapts the covariance matrix so that it becomes proportional to the inverse of the Hessian of the original objective function. We also show the speed of covariance matrix adaptation and the speed of convergence of the parameters. We introduce a stochastic algorithm that approximates the natural gradient with finite samples and present some simulated results to evaluate how precisely the stochastic algorithm approximates the deterministic, ideal one under finite samples and to see how similarly our algorithm and the CMA-ES perform.