Automatic extraction of keywords for the portuguese language

Authors:
Maria Abadia Lacerda Dias;Marcelo de Gomensoro Malheiros
Affiliations:
UNICAMP – State University of Campinas, Campinas, SP, Brazil;UNIVATES University Center, Lajeado, RS, Brazil
Venue:
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Year:
2006

Citing 1
Cited 0

KEA: practical automatic keyphrase extraction

Proceedings of the fourth ACM conference on Digital libraries

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper outlines the adaptation of an algorithm for automatic extraction of keywords for the Portuguese Language. Keywords make possible to summarize the contents of documents in a compact form, and may also be used as an efficient measure of similarity between texts. This work is focused on the extraction of keywords for theses on several fields of knowledge. To identify the keywords the KEA algorithm was used, together with a stemming technique specific to Portuguese and a manually created list of stopwords. It is shown that the results obtained are good enough for practical use and similarly match what have been done for the English Language.