An experimental study of boosting model classifiers for chinese text categorization

  • Authors:
  • Yibing Geng;Guomin Zhu;Junrui Qiu;Jilian Fan;Jingchang Zhang

  • Affiliations:
  • Library, Second Military Medical University, Shanghai, P.R. China;Library, Second Military Medical University, Shanghai, P.R. China;Library, Second Military Medical University, Shanghai, P.R. China;Library, Second Military Medical University, Shanghai, P.R. China;Library, Second Military Medical University, Shanghai, P.R. China

  • Venue:
  • ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Text categorization is a crucial task of increasing importance. Our work focuses on the study of Chinese text categorization on the basis of Boosting model. We chose the People's Daily news from TREC5 as our benchmark datasets. A minor modification to AdaBoost algorithm (Freund and Schapire, 1996, 2000) was applied for this hypothesis. By way of using the F1 measure for its final evaluation, the results of the Boosting model (AdaBoost.MH) is proved to be effective and outperforms most of other algorithms reported for Chinese text categorization.