The Method of Text Categorization on Imbalanced Datasets

  • Authors:
  • Li Xin-fu;Yu Yan;Yin Peng

  • Affiliations:
  • -;-;-

  • Venue:
  • ICCSN '09 Proceedings of the 2009 International Conference on Communication Software and Networks
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In practical applications, datasets are usually imbalanced, but traditional approaches usually lead a low recognition rate. To address this problem, in this paper, over-sampling of the minority class has been proposed to increase the number of minority class, so as to achieve balance, thereby enhancing recognition rate of minority class. The experiments show that this approach achieved satisfactory results.