Genetic Programming with a Genetic Algorithm for Feature Construction and Selection

  • Authors:
  • Matthew G. Smith;Larry Bull

  • Affiliations:
  • Faculty of Computing, Engineering & Mathematical Sciences, University of the West of England, Bristol, UK BS16 1QY;Faculty of Computing, Engineering & Mathematical Sciences, University of the West of England, Bristol, UK BS16 1QY

  • Venue:
  • Genetic Programming and Evolvable Machines
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

The use of machine learning techniques to automatically analyse data for information is becoming increasingly widespread. In this paper we primarily examine the use of Genetic Programming and a Genetic Algorithm to pre-process data before it is classified using the C4.5 decision tree learning algorithm. Genetic Programming is used to construct new features from those available in the data, a potentially significant process for data mining since it gives consideration to hidden relationships between features. A Genetic Algorithm is used to determine which such features are the most predictive. Using ten well-known datasets we show that our approach, in comparison to C4.5 alone, provides marked improvement in a number of cases. We then examine its use with other well-known machine learning techniques.