Fitness Function Comparison for GA-Based Feature Construction

  • Authors:
  • Leila S. Shafti;Eduardo Pérez

  • Affiliations:
  • Escuela Plitécnica Superior, Universidad Autónoma de Madrid, E-28049, Spain;Escuela Plitécnica Superior, Universidad Autónoma de Madrid, E-28049, Spain

  • Venue:
  • Current Topics in Artificial Intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

When primitive data representation yields attribute interactions, learning requires feature construction. MFE2/GA, a GA-based feature construction has been shown to learn more accurately than others when there exist several complex attribute interactions. A new fitness function, based on the principle of Minimum Description Length (MDL), is proposed and implemented as part of the MFE3/GA system. Since the individuals of the GA population are collections of new features constructed to change the representation of data, an MDL-based fitness considers not only the part of data left unexplained by the constructed features (errors), but also the complexity of the constructed features as a new representation (theory). An empirical study shows the advantage of the new fitness over other fitness not based on MDL, and both are compared to the performance baselines provided by relevant systems.