A Comparison of Several Ensemble Methods for Text Categorization

  • Authors:
  • Yan-Shi Dong;Ke-Song Han

  • Affiliations:
  • Shanghai Jiao Tong University;Motorola Labs, China Research Center

  • Venue:
  • SCC '04 Proceedings of the 2004 IEEE International Conference on Services Computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Text categorization (TC), as an important domain of machine learning, has many unique traits, such as huge number of features, serious redundant features, dataset imbalance, etc. In this paper the various ensemble methods of naïve Bayes classifiers and SVM classifiers are experimentally compared on the TC tasks. Besides, a new type of classifiers, moderated asymmetric naïve Bayes classifiers, is proposed. Its advantages over the conventional naïve Bayes classifiers in performance and computational efficiency are demonstrated.