Differentiating between individual class performance in genetic programming fitness for classification with unbalanced data

  • Authors:
  • Urvesh Bhowan;Mark John'ston;Mengjie Zhang

  • Affiliations:
  • School of Engineering and Computer Science and Victoria University of Wellington, Wellington, New Zealand;School of Mathematics, Statistics and Operations Research and Victoria University of Wellington, Wellington, New Zealand;School of Engineering and Computer Science and Victoria University of Wellington, Wellington, New Zealand

  • Venue:
  • CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
  • Year:
  • 2009

Quantified Score

Hi-index 0.02

Visualization

Abstract

This paper investigates improvements to the fitness function in Genetic Programming to better solve binary classification problems with unbalanced data. Data sets are unbalanced when there is a majority of examples for one particular class over the other class(es). We show that using overall classification accuracy as the fitness function evolves classifiers with a performance bias toward the majority class at the expense of minority class performance. We develop four new fitness functions which consider the accuracy of majority and minority class separately to address this learning bias. Results using these fitness functions show that good accuracy for both the minority and majority classes can be achieved from evolved classifiers while keeping overall performance high and balanced across the two classes.