A lexicon-based stemming procedure

  • Authors:
  • Gilberto Silva;Claudia Oliveira

  • Affiliations:
  • Datasus - Centro de Tecnologia da Informação do Ministério da Saúde, Rio de Janeiro, Brazil;Departamento de Engenharia de Computação, Instituto Militar de Engenharia, Rio de Janeiro, Brazil

  • Venue:
  • PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
  • Year:
  • 2003

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper describes a stemming technique that depends principally on a target language's lexicon, organised as an automaton of word strings. The clear distinction between the lexicon and the procedure itself allows the stemmer to be customised for any language with little or even no changes to the program's source code.