REPMAC: A New Hybrid Approach to Highly Imbalanced Classification Problems

  • Authors:
  • Hernán Ahumada;Guillermo L. Grinblat;Lucas C. Uzal;Pablo M. Granitto;Alejandro Ceccatto

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • HIS '08 Proceedings of the 2008 8th International Conference on Hybrid Intelligent Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The class imbalance problem (when one of the classes has much less samples than the others) is of great importance in machine learning, because it corresponds to many critical applications. In this work we introduce the Recursive Partitioning of the Majority Class (REPMAC) algorithm, a new hybrid method to solve imbalanced problems. Using a clustering method, REPMAC recursively splits the majority class in several subsets, creating a decision tree, until the resulting sub-problems are balanced or easy to solve. At that point, a classifier is fitted to each sub-problem. We evaluate the new method on 7 datasets from the UCI repository, finding that REPMAC is more efficient than other methods usually applied to imbalanced datasets.