Large margin DragPushing strategy for centroid text categorization

  • Authors:
  • Songbo Tan

  • Affiliations:
  • Software Department, Institute of Computing Technology, Chinese Academy of Sciences, P.O. Box 2704, Beijing, 100080, PR China and Graduate School of the Chinese Academy of Sciences, PR China

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2007

Quantified Score

Hi-index 12.05

Visualization

Abstract

Among all conventional methods for text categorization, centroid classifier is a simple and efficient method. However it often suffers from inductive bias (or model misfit) incurred by its assumption. DragPushing is a very simple and yet efficient method to address this so-called inductive bias problem. However, DragPushing employs only one criterion, i.e., training-set error, as its objective function that cannot guarantee the generalization capability. In this paper, we propose a generalized DragPushing strategy for centroid classifier, which we called as ''Large Margin DragPushing'' (LMDP). The experiments conducted on three benchmark evaluation collections show that LMDP achieved about one percent improvement over the performance of DragPushing and delivered top performance nearly as well as state-of-the-art SVM without incurring significant computational costs.