Static pruning of terms in inverted files

  • Authors:
  • Roi Blanco;Álvaro Barreiro

  • Affiliations:
  • IRLab, Computer Science Department, University of Coruña, Spain;IRLab, Computer Science Department, University of Coruña, Spain

  • Venue:
  • ECIR'07 Proceedings of the 29th European conference on IR research
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the problem of identifying collection dependent stop-words in order to reduce the size of inverted files. We present four methods to automatically recognise stop-words, analyse the tradeoff between efficiency and effectiveness, and compare them with a previous pruning approach. The experiments allow us to conclude that in some situations stop-words pruning is competitive with respect to other inverted file reduction techniques.