Class-dependent projection based method for text categorization

  • Authors:
  • Lifei Chen;Gongde Guo;Kaijun Wang

  • Affiliations:
  • School of Mathematics and Computer Science, Fujian Normal University, Fujian 350108, China;School of Mathematics and Computer Science, Fujian Normal University, Fujian 350108, China;School of Mathematics and Computer Science, Fujian Normal University, Fujian 350108, China

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2011

Quantified Score

Hi-index 0.10

Visualization

Abstract

Text categorization presents unique challenges to traditional classification methods due to the large number of features inherent in the datasets from real-world applications of text categorization, and a great deal of training samples. In high-dimensional document data, the classes are typically categorized only by subsets of features, which are typically different for the classes of different topics. This paper presents a simple but effective classifier for text categorization using class-dependent projection based method. By projecting onto a set of individual subspaces, the samples belonging to different document classes are separated such that they are easily to be classified. This is achieved by developing a new supervised feature weighting algorithm to learn the optimized subspaces for all the document classes. The experiments carried out on common benchmarking corpuses showed that the proposed method achieved both higher classification accuracy and lower computational costs than some distinguishing classifiers in text categorization, especially for datasets including document categories with overlapping topics.