A latent semantic classification model

  • Authors:
  • Ming-Wen Wang;Jian-Yun Nie;Xue-Qiang Zeng

  • Affiliations:
  • Jiangxi Normal University, Nanchang, Jiangxi, China;Université de Montréal, Montreal, Quebec, Canada;Jiangxi Medical College, Nanchang, Jiangxi, China

  • Venue:
  • Proceedings of the 14th ACM international conference on Information and knowledge management
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Latent Semantic Indexing (LSI) has been successfully applied to information retrieval and text classification. However, when LSI is used in classification, some important features for small classes may be ignored because of their small feature values. To solve this problem, we propose the latent semantic classification (LSC) model which extends the LSI model in the following way: the classification information of the training documents is introduced into the latent semantic structure via a second set of latent variables, so that both indexing and classification information can be taken into account during the classification process. Our experiments on Reuters show that our new model performs better than the existing classification methods such as kNN and SVM.