Key phrase extraction: a hybrid assignment and extraction approach

  • Authors:
  • Chau Q. Nguyen;Tuoi T. Phan

  • Affiliations:
  • HCM University of Industry, Go Vap Dist, HCMC, Vietnam;HCMC University of Technology, HCMC, Vietnam

  • Venue:
  • Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic key phrase extraction is fundamental to the success of many recent digital library applications and semantic information retrieval techniques and a difficult and essential problem in Vietnamese natural language processing (NLP). In this work, we propose a novel method for key phrase extracting of Vietnamese text that combines assignment and extraction approaches. We also explore NLP techniques that we propose for the analysis of Vietnamese texts, focusing on the advanced candidate phrases recognition phase as well as part-of-speech (POS) tagging. Then we propose a method that exploits specific characteristics of the Vietnamese language and exploits the Vietnamese Wikipedia as an ontology for key phrase ambiguity resolution. Finally, we show the results of several experiments that have examined the impacts of strategies chosen for Vietnamese key phrase extracting.