Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Machine Learning
From data mining to knowledge discovery: an overview
Advances in knowledge discovery and data mining
Target detection in SAR imagery by genetic programming
Advances in Engineering Software
Genetic Programming for Feature Discovery and Image Discrimination
Proceedings of the 5th International Conference on Genetic Algorithms
Learning and example selection for object and pattern detection
Learning and example selection for object and pattern detection
Genetic Programming for data classification: partitioning the search space
Proceedings of the 2004 ACM symposium on Applied computing
Editorial: special issue on learning from imbalanced data sets
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Data mining in metric space: an empirical analysis of supervised learning performance criteria
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Genetic programming classification of magnetic resonance data
GECCO '96 Proceedings of the 1st annual conference on Genetic and evolutionary computation
Learning when training data are costly: the effect of class distribution on tree induction
Journal of Artificial Intelligence Research
A linear genetic programming approach to intrusion detection
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Fitness functions in genetic programming for classification with unbalanced data
AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
GP classification under imbalanced data sets: active sub-sampling and AUC approximation
EuroGP'08 Proceedings of the 11th European conference on Genetic programming
Score aggregation techniques in retrieval experimentation
ADC '09 Proceedings of the Twentieth Australasian Conference on Australasian Database - Volume 92
Improving the classification accuracy of RBF and MLP neural networks trained with imbalanced samples
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Training genetic programming on half a million patterns: an example from anomaly detection
IEEE Transactions on Evolutionary Computation
Multi-Objective Genetic Programming for Classification with Unbalanced Data
AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
AUC analysis of the pareto-front using multi-objective GP for classification with unbalanced data
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Genetic programming for classification with unbalanced data
EuroGP'10 Proceedings of the 13th European conference on Genetic Programming
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Hi-index | 0.02 |
This paper investigates improvements to the fitness function in Genetic Programming to better solve binary classification problems with unbalanced data. Data sets are unbalanced when there is a majority of examples for one particular class over the other class(es). We show that using overall classification accuracy as the fitness function evolves classifiers with a performance bias toward the majority class at the expense of minority class performance. We develop four new fitness functions which consider the accuracy of majority and minority class separately to address this learning bias. Results using these fitness functions show that good accuracy for both the minority and majority classes can be achieved from evolved classifiers while keeping overall performance high and balanced across the two classes.