BUAP-UPV TPIRS: a system for document indexing reduction at WebCLEF

  • Authors:
  • David Pinto;Héctor Jiménez-Salazar;Paolo Rosso;Emilio Sanchis

  • Affiliations:
  • Department of Information Systems and Computation, UPV, Valencia, Spain;Faculty of Computer Science, BUAP, Ciudad Universitaria, Puebla, Mexico;Department of Information Systems and Computation, UPV, Valencia, Spain;Department of Information Systems and Computation, UPV, Valencia, Spain

  • Venue:
  • CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present the results of BUAP/UPV universities in WebCLEF, a particular task of CLEF 2005. Particularly, we evaluate our information retrieval system at the bilingual “English to Spanish” task. Our system uses a term reduction process based on the Transition Point technique. Our results show that it is possible to reduce the number of terms to index, thereby improving the performance of our system. We evaluate different percentages of reduction over a subset of EuroGOV, in order to determine the best one. We observed that after reducing the 82.55% of the corpus, a Mean Reciprocal Rank of 0.0844 was obtained, compared with 0.0465 of such evaluation with full documents.