SVM-Based feature selection of latent semantic features

  • Authors:
  • K. Shima;M. Todoriki;A. Suzuki

  • Affiliations:
  • Department of Quantum Engineering and Systems Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 1138656, Japan;Department of Quantum Engineering and Systems Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 1138656, Japan;Department of Quantum Engineering and Systems Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 1138656, Japan

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2004

Quantified Score

Hi-index 0.10

Visualization

Abstract

Latent Semantic Indexing (LSI) is an effective method to extract features that captures underlying latent semantic structure in the word usage across documents, However, subspace selected by this method may not be the most appropriate one to classify documents, since it orders extracted features according to their variances, not the classification power. We propose to apply feature ordering method based on support vector machines in order to select LSI-features that is suited for classification. Experimental results suggest that the method improves classification performance with considerably more compact representation.