Improving SVM training by means of NTIL when the data sets are imbalanced

  • Authors:
  • Carlos E. Vivaracho

  • Affiliations:
  • Dep. Informática, U. de Valladolid, Spain

  • Venue:
  • ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.02

Visualization

Abstract

This paper deals with the problem of training a discriminative classifier when the data sets are imbalanced. More specifically, this work is concerned with the problem of classify a sample as belonging, or not, to a Target Class (TC), when the number of examples from the “Non-Target Class” (NTC) is much higher than those of the TC. The effectiveness of the heuristic method called Non Target Incremental Learning (NTIL) in the task of extracting, from the pool of NTC representatives, the most discriminant training subset with regard to the TC, has been proved when an Artificial Neural Network is used as classifier (ISMIS 2003). In this paper the effectiveness of this method is also shown for Support Vector Machines.