A balanced accuracy fitness function leads to robust analysis using grammatical evolution neural networks in the case of class imbalance

Authors:
Nicholas E. Hardison;Theresa J. Fanelli;Scott M. Dudek;David M. Reif;Marylyn D. Ritchie;Alison A. Motsinger-Reif
Affiliations:
North Carolina State University, Raleigh, NC, USA;Vanderbilt University, Nashville, TN, USA;Vanderbilt University, Nashville, TN, USA;US Environmental Protection Agency, Research Triangle Park, NC, USA;Vanderbilt University, Nashville, TN, USA;North Carolina State University, Raleigh, NC, USA
Venue:
Proceedings of the 10th annual conference on Genetic and evolutionary computation
Year:
2008

Citing 2
Cited 0

Application Of Genetic Algorithms To The Discovery Of Complex Models For Simulation Studies In Human Genetics

GECCO '02 Proceedings of the Genetic and Evolutionary Computation Conference
The class imbalance problem: A systematic study

Intelligent Data Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

Grammatical Evolution Neural Networks (GENN) is a computational method designed to detect gene-gene interactions in genetic epidemiology, but has so far only been evaluated in situations with balanced numbers of cases and controls. Real data, however, rarely has such perfectly balanced classes. In the current study, we test the power of GENN to detect interactions in data with a range of class imbalance using two fitness functions (classification error and balanced error), as well as data re-sampling. We show that when using classification error, class imbalance greatly decreases the power of GENN. Re-sampling methods demonstrated improved power, but using balanced accuracy resulted in the highest power. Based on the results of this study, balanced error has replaced classification error in the GENN algorithm