Information retrieval: data structures and algorithms
Information retrieval: data structures and algorithms
An evaluation method for stemming algorithms
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to knowledge systems
Introduction to knowledge systems
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A novel method for stemmer generation based on hidden markov models
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Assessing the impact of stemming accuracy on information retrieval
PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
Evaluation of normalization techniques in text classification for portuguese
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part III
Hi-index | 0.00 |
Stemming algorithms have traditionally been utilized in information retrieval systems as they generate a more concise word representation. However, the efficiency of these algorithms varies according to the language they are used with. This paper presents STEMBR, a stemmer for Brazilian Portuguese whereby the suffix treatment is based on a statistical study of the frequency of the last letter for words found in Brazilian web pages. The proposed stemmer is compared with another algorithm specifically developed for Portuguese. The results show the efficiency of our stemmer.