Assessing the impact of stemming accuracy on information retrieval

  • Authors:
  • Felipe N. Flores;Viviane P. Moreira;Carlos A. Heuser

  • Affiliations:
  • Instituto de Informática – UFRGS, Porto Alegre, Brazil;Instituto de Informática – UFRGS, Porto Alegre, Brazil;Instituto de Informática – UFRGS, Porto Alegre, Brazil

  • Venue:
  • PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The quality of stemming algorithms is typically measured in two different ways: (i) how accurately they map the variant forms of a word to the same stem; or (ii) how much improvement they bring to Information Retrieval. In this paper, we evaluate different Portuguese stemming algorithms in terms of accuracy and in terms of their aid to Information Retrieval. The aim is to assess whether the most accurate stemmers are also the ones that bring the biggest gain in Information Retrieval. Our results show that some kind of correlation does exist, but it is not as strong as one might have expected.