Darwinian embodied evolution of the learning ability for survival

  • Authors:
  • Stefan Elfwing;Eiji Uchibe;Kenji Doya;Henrik I Christensen

  • Affiliations:
  • Centre for Autonomous Systems, Numerical Analysis andComputer Science, Royal Institute of Technology (KTH), Sweden, Neural Computation Unit, Initial Research Project, OkinawaInstitute of Science a ...;Neural Computation Unit, Initial Research Project, OkinawaInstitute of Science and Technology, JST, Japan;Neural Computation Unit, Initial Research Project, OkinawaInstitute of Science and Technology, JST, Japan;Centre for Autonomous Systems, Numerical Analysis andComputer Science, Royal Institute of Technology (KTH), Sweden

  • Venue:
  • Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article we propose a framework for performing embodied evolution with a limited number of robots, by utilizing time-sharing in subpopulations of virtual agents hosted in each robot. Within this framework, we explore the combination of within-generation learning of basic survival behaviors by reinforcement learning, and evolutionary adaptations over the generations of the basic behavior selection policy, the reward functions, and metaparameters for reinforcement learning. We apply a biologically inspired selection scheme, in which there is no explicit communication of the individuals芒聙聶 fitness information. The individuals can only reproduce offspring by mating芒聙聰a pair-wise exchange of genotypes芒聙聰and the probability that an individual reproduces offspring in its own subpopulation is dependent on the individual芒聙聶s 芒聙聵芒聙聵health,芒聙聶芒聙聶 that is, energy level, at the mating occasion. We validate the proposed method by comparing it with evolution using standard centralized selection, in simulation, and by transferring the obtained solutions to hardware using two real robots.