The effect of class imbalance, complexity, size, and learning distribution on classifier performance

  • Authors:
  • Sofia Visa

  • Affiliations:
  • Department of Mathematics and Computer Science, College of Wooster, 1189 Beall Ave., Wooster, OH 44691, USA

  • Venue:
  • International Journal of Advanced Intelligence Paradigms
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Classes of real world datasets have various properties (such as imbalance, size, complexity, and class distribution) that make the classification task more difficult. We investigate the robustness of six classification techniques over data having various combinations of the above mentioned properties. One artificial domain and six real world datasets are used in these experiments. Results of our analysis point to the frequency-based classifiers (such as the fuzzy and the Bayes classifiers) as being more robust over various imbalance, size, complexity, and training distribution.