Inverse random under sampling for class imbalance problem and its application to multi-label classification

  • Authors:
  • Muhammad Atif Tahir;Josef Kittler;Fei Yan

  • Affiliations:
  • Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford GU2 7XH, UK and School of Computing, Engineering and Information Science, Northumbria University, Newcastle Upon Ty ...;Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford GU2 7XH, UK;Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford GU2 7XH, UK

  • Venue:
  • Pattern Recognition
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, a novel inverse random under sampling (IRUS) method is proposed for the class imbalance problem. The main idea is to severely under sample the majority class thus creating a large number of distinct training sets. For each training set we then find a decision boundary which separates the minority class from the majority class. By combining the multiple designs through fusion, we construct a composite boundary between the majority class and the minority class. The proposed methodology is applied on 22 UCI data sets and experimental results indicate a significant increase in performance when compared with many existing class-imbalance learning methods. We also present promising results for multi-label classification, a challenging research problem in many modern applications such as music, text and image categorization.