Clustering to find exemplar terms for keyphrase extraction

Authors:
Zhiyuan Liu;Peng Li;Yabin Zheng;Maosong Sun
Affiliations:
Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;Tsinghua University, Beijing, China
Venue:
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Year:
2009

Citing 14
Cited 15

Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
A practical system of keyphrase extraction for web pages

Proceedings of the 14th ACM international conference on Information and knowledge management
Improved automatic keyword extraction given more linguistic knowledge

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Keyphrase Extraction Using Semantic Networks Structure Analysis

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
The Google Similarity Distance

IEEE Transactions on Knowledge and Data Engineering
Generating summary keywords for emails using topics

Proceedings of the 13th international conference on Intelligent user interfaces
KP-Miner: A keyphrase extraction system for English and Arabic documents

Information Systems
Extracting key terms from noisy and multitheme documents

Proceedings of the 18th international conference on World wide web
CollabRank: towards a collaborative approach to single-document keyphrase extraction

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Graph-based keyword extraction for single-document summarization

MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Single document keyphrase extraction using neighborhood knowledge

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Domain-specific keyphrase extraction

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Computing semantic relatedness using Wikipedia-based explicit semantic analysis

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Automatic hypertext keyphrase detection

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence

SemEval-2010 task 5: Automatic keyphrase extraction from scientific articles

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Likey: Unsupervised language-independent keyphrase extraction

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
SZTERGAK: Feature engineering for keyphrase extraction

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Automatic free-text-tagging of online news archives

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Automatic keyphrase extraction via topic decomposition

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Conundrums in unsupervised keyphrase extraction: making sense of the state-of-the-art

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Automatic keyphrase extraction by bridging vocabulary gap

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Unsupervised topic-oriented keyphrase extraction and its application to Croatian

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
A simple word trigger method for social tag suggestion

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning to extract coherent keyphrases from online news

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Fast algorithm for affinity propagation

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Short-text domain specific key terms/phrases extraction using an n-gram model with wikipedia

Proceedings of the 21st ACM international conference on Information and knowledge management
DIKEA: domain-independent keyphrase extraction algorithm

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Integrating semantic relatedness and words' intrinsic features for keyword extraction

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Automatic keyphrase extraction from scientific articles

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Keyphrases are widely used as a brief summary of documents. Since manual assignment is time-consuming, various unsupervised ranking methods based on importance scores are proposed for keyphrase extraction. In practice, the keyphrases of a document should not only be statistically important in the document, but also have a good coverage of the document. Based on this observation, we propose an unsupervised method for keyphrase extraction. Firstly, the method finds exemplar terms by leveraging clustering techniques, which guarantees the document to be semantically covered by these exemplar terms. Then the keyphrases are extracted from the document using the exemplar terms. Our method outperforms sate-of-the-art graph-based ranking methods (TextRank) by 9.5% in F1-measure.