A class of multistep sparse matrix strategies for concept decomposition matrix approximation

  • Authors:
  • Chi Shen;Mike Unuakhalu

  • Affiliations:
  • Kentucky State University, Frankfort, KY;Kentucky State University, Frankfort, KY

  • Venue:
  • Proceedings of the 2009 ACM symposium on Applied Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In information retrieval, text documents are usually modeled as a term-document matrix which has high dimensional and space vectors. To reduce the high dimensions, one of the various dimensionality reduction methods, concept decomposition, has been developed by [3]. This method is based on document clustering techniques and least-square matrix approximation to approximate the matrix of vectors. Gao and Zhang [4] have indicated that the retrieval accuracy from the concept decomposition can be comparable to that from Latent Semantic Indexing. However the numerical computation is expensive. In this paper we presented a class of multistep spare matrix strategies for concept decomposition matrix approximation. In this approach, a series of simple sparse matrices are used to approximate the decompositions. Our numerical experiments show the advantage of such an approach in terms of storage costs and query time compared with other approaches while maintaining comparable retrieval quality.