Generalized vector spaces model in information retrieval

  • Authors:
  • S. K. M. Wong;Wojciech Ziarko;Patrick C. N. Wong

  • Affiliations:
  • Department of Computer Science, University of Regina, Regina, Sask., Canada S4S 0A2;-;-

  • Venue:
  • SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 1985

Quantified Score

Hi-index 0.00

Visualization

Abstract

In information retrieval, it is common to model index terms and documents as vectors in a suitably defined vector space. The main difficulty with this approach is that the explicit representation of term vectors is not known a priori. For this reason, the vector space model adopted by Salton for the SMART system treats the terms as a set of orthogonal vectors. In such a model it is often necessary to adopt a separate, corrective procedure to take into account the correlations between terms. In this paper, we propose a systematic method (the generalized vector space model) to compute term correlations directly from automatic indexing scheme. We also demonstrate how such correlations can be included with minimal modification in the existing vector based information retrieval systems. The preliminary experimental results obtained from the new model are very encouraging.