Multi-class imbalanced data-sets with linguistic fuzzy rule based classification systems based on pairwise learning

  • Authors:
  • Alberto Fernández;Mara José Del Jesus;Francisco Herrera

  • Affiliations:
  • Dept. of Computer Science, University of Jaén;Dept. of Computer Science, University of Jaén;Dept. of Computer Science and A.I., University of Granada

  • Venue:
  • IPMU'10 Proceedings of the Computational intelligence for knowledge-based systems design, and 13th international conference on Information processing and management of uncertainty
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a classification task, the imbalance class problem is present when the data-set has a very different distribution of examples among their classes. The main handicap of this type of problem is that standard learning algorithms consider a balanced training set and this supposes a bias towards the majority classes. In order to provide a correct identification of the different classes of the problem, we propose a methodology based on two steps: first we will use the one-vs-one binarization technique for decomposing the original data-set into binary classification problems. Then, whenever each one of these binary subproblems is imbalanced, we will apply an oversampling step, using the SMOTE algorithm, in order to rebalance the data before the pairwise learning process. For our experimental study we take as basis algorithm a linguistic Fuzzy Rule Based Classification System, and we aim to show not only the improvement in performance achieved with our methodology against the basic approach, but also to show the good synergy of the pairwise learning proposal with the selected oversampling technique.