Classification of Text Documents

  • Authors:
  • Affiliations:
  • Venue:
  • ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate four different classification methods for document classification. Naive Bayes classifier, nearest neighbor classifier, decision tree classifier and subspace method were applied to seven-class Yahoo newsgroups individually and in combination. We studied three classifier combination approaches: simple voting, dynamic classifier selection, and adaptive classifier combination. Our experimental results indicate that naive Bayes classifier and the subspace method outperform the other two classification methods on our data sets. Combinations of multiple classifiers did not always improve classification accuracy. Among the three different combination approaches, the adaptive classifier combination method proposed here performed the best.