Semantically-based crossover in genetic programming: application to real-valued symbolic regression

  • Authors:
  • Nguyen Quang Uy;Nguyen Xuan Hoai;Michael O'Neill;R. I. Mckay;Edgar Galván-López

  • Affiliations:
  • Complex & Adaptive Systems Lab, School of Computer Science & Informatics, University College Dublin, Dublin, Ireland;Department of Computer Science, Le Quy Don University, Hanoi, Vietnam;Complex & Adaptive Systems Lab, School of Computer Science & Informatics, University College Dublin, Dublin, Ireland;School of Computer Science and Engineering, Seoul National University, Seoul, Korea;Complex & Adaptive Systems Lab, School of Computer Science & Informatics, University College Dublin, Dublin, Ireland

  • Venue:
  • Genetic Programming and Evolvable Machines
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate the effects of semantically-based crossover operators in genetic programming, applied to real-valued symbolic regression problems. We propose two new relations derived from the semantic distance between subtrees, known as semantic equivalence and semantic similarity. These relations are used to guide variants of the crossover operator, resulting in two new crossover operators--semantics aware crossover (SAC) and semantic similarity-based crossover (SSC). SAC, was introduced and previously studied, is added here for the purpose of comparison and analysis. SSC extends SAC by more closely controlling the semantic distance between subtrees to which crossover may be applied. The new operators were tested on some real-valued symbolic regression problems and compared with standard crossover (SC), context aware crossover (CAC), Soft Brood Selection (SBS), and No Same Mate (NSM) selection. The experimental results show on the problems examined that, with computational effort measured by the number of function node evaluations, only SSC and SBS were significantly better than SC, and SSC was often better than SBS. Further experiments were also conducted to analyse the perfomance sensitivity to the parameter settings for SSC. This analysis leads to a conclusion that SSC is more constructive and has higher locality than SAC, NSM and SC; we believe these are the main reasons for the improved performance of SSC.