Association classification based on sample weighting

  • Authors:
  • Jin Zhang;Xiaoyun Chen;Yi Chen;Yunfa Hu

  • Affiliations:
  • Department of Computer & Information Technology, Fudan University, Shanghai, China;,Department of Computer & Information Technology, Fudan University, Shanghai, China;Department of Computer & Information Technology, Fudan University, Shanghai, China;Department of Computer & Information Technology, Fudan University, Shanghai, China

  • Venue:
  • FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the territory of text categorization, the distribution and quality of sample set is highly influential to categorization result. Associated rule categorization ARC-BC is effective under common circumstances. The accuracy of categorization obviously falls as distribution of feature words of training samples is uneven. In this paper, a Chinese text classification approach was proposed based on sample weighting associated rules (SW-ARC). The approach improved substantial classification efficiency by performing self-adapting sample weights adjustment. Experiment result shows SW-ARC can solve the quality fall caused by uneven distribution of feature words. Macro-average recall of open test increases from 50% of ARC-BC to 70% of SW-ARC, Macro-average precision increases from 28% of ARC-BC to 70% of SW-ARC.