Likey: Unsupervised language-independent keyphrase extraction

  • Authors:
  • Mari-Sanna Paukkeri;Timo Honkela

  • Affiliations:
  • Aalto University School of Science and Technology, AALTO, Finland;Aalto University School of Science and Technology, AALTO, Finland

  • Venue:
  • SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

Likey is an unsupervised statistical approach for keyphrase extraction. The method is language-independent and the only language-dependent component is the reference corpus with which the documents to be analyzed are compared. In this study, we have also used another language-dependent component: an English-specific Porter stemmer as a preprocessing step. In our experiments of keyphrase extraction from scientific articles, the Likey method outperforms both supervised and unsupervised baseline methods.