The influence of semantics in IR using LSI and K-means clustering techniques

  • Authors:
  • D. Jiménez;E. Ferretti;V. Vidal;P. Rosso;C. F. Enguix

  • Affiliations:
  • Polythecnic University of Valencia, Spain;National University of San Luis, Argentina;Polythecnic University of Valencia, Spain;Polythecnic University of Valencia, Spain;Mediterranean University of Science and Technology, Spain

  • Venue:
  • ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we study the influence of semantics in the information retrieval preprocessing. We concretely compare the reached performance with stemming and semantic lemmatization as preprocessing. Three techniques are used in the study: the direct use of a weighted matrix, the SVD technique in the LSI model and the bisecting spherical k-means clustering technique. although the results seem not to be very promising, we believe that they should be improved in the future.