Improved automatic keyword extraction given more linguistic knowledge

Authors:
Anette Hulth
Affiliations:
Stockholm University, Sweden
Venue:
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Year:
2003

Citing 6
Cited 61

Lexical analysis and stoplists

Information retrieval
Bagging predictors

Machine Learning
Learning Algorithms for Keyphrase Extraction

Information Retrieval
Domain-Specific Keyphrase Extraction

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Applications of term identification technology: domain description and content characterisation

Natural Language Engineering
Towards automatic extraction of monolingual and bilingual terminology

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1

Finding advertising keywords on web pages

Proceedings of the 15th international conference on World Wide Web
A study on automatically extracted keywords in text categorization

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Wikify!: linking documents to encyclopedic knowledge

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Leveraging context in user-centric entity detection systems

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Using the wisdom of the crowds for keyword generation

Proceedings of the 17th international conference on World Wide Web
Discovering key concepts in verbose queries

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Mining knowledge from natural language texts using fuzzy associated concept mapping

Information Processing and Management: an International Journal
Using tag semantic network for keyphrase extraction in blogs

Proceedings of the 17th ACM conference on Information and knowledge management
KP-Miner: A keyphrase extraction system for English and Arabic documents

Information Systems
Estimating the impressionrank of web pages

Proceedings of the 18th international conference on World wide web
Review-oriented metadata enrichment: a case study

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Detecting multiple facets of an event using graph-based unsupervised methods

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
CollabRank: towards a collaborative approach to single-document keyphrase extraction

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A probabilistic framework for automatic term recognition

Intelligent Data Analysis
Enhancing linguistically oriented automatic keyword extraction

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Combining Statistical Machine Learning Models to Extract Keywords from Chinese Documents

ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Single document keyphrase extraction using neighborhood knowledge

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Unsupervised approaches for automatic keyword extraction using meeting transcripts

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Clustering to find exemplar terms for keyphrase extraction

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction

ACM Transactions on Information Systems (TOIS)
Evaluating verbose query processing techniques

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Automatic generation of personalized annotation tags for Twitter users

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
SemEval-2010 task 5: Automatic keyphrase extraction from scientific articles

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Likey: Unsupervised language-independent keyphrase extraction

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Eddi: interactive topic-based browsing of social status streams

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Automatic keyphrase extraction via topic decomposition

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Pattern based keyword extraction for contextual advertising

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
SemanticRank: ranking keywords and sentences using semantic graphs

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Keyphrases extraction from scientific documents: improving machine learning approaches with natural language processing

ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
Advertising keywords extraction from web pages

WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Conundrums in unsupervised keyphrase extraction: making sense of the state-of-the-art

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Unsupervised extraction of keywords from news archives

LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
Automatic keyphrase extraction by bridging vocabulary gap

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Autonomous and adaptive identification of topics in unstructured text

KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part II
Usage pattern recognition in student activities

EC-TEL'11 Proceedings of the 6th European conference on Technology enhanced learning: towards ubiquitous learning
Keyphrase extraction in biomedical publications using mesh and intraphrase word co-occurrence information

Proceedings of the ACM fifth international workshop on Data and text mining in biomedical informatics
Language technology for elearning

EC-TEL'06 Proceedings of the First European conference on Technology Enhanced Learning: innovative Approaches for Learning and Knowledge Sharing
Extracting search-focused key n-grams for relevance ranking in web search

Proceedings of the fifth ACM international conference on Web search and data mining
Automatic extraction and learning of keyphrases from scientific articles

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Improving persian text classification using persian thesaurus

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Learning to extract coherent keyphrases from online news

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Extracting keyphrase set with high diversity and coverage using structural SVM

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Using Wikipedia concepts and frequency in language to extract key terms from support documents

Expert Systems with Applications: An International Journal
“Without the clutter of unimportant words”: Descriptive keyphrases for text visualization

ACM Transactions on Computer-Human Interaction (TOCHI)
Generating queries from user-selected text

Proceedings of the 4th Information Interaction in Context Symposium
Measuring comparability of documents in non-parallel corpora for efficient extraction of (semi-)parallel translation equivalents

EACL 2012 Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra)
Keyphrase extraction through query performance prediction

Journal of Information Science
Automatic keyword extraction from single-sentence natural language queries

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Key action extraction for learning analytics

EC-TEL'12 Proceedings of the 7th European conference on Technology Enhanced Learning
DIKEA: domain-independent keyphrase extraction algorithm

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
LinkedVis: exploring social and semantic career recommendations

Proceedings of the 2013 international conference on Intelligent user interfaces
NE-Rank: A Novel Graph-Based Keyphrase Extraction in Twitter

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Enhancing biomedical concept extraction using semantic relationship weights

International Journal of Data Mining and Bioinformatics
Topic hierarchy construction for the organization of multi-source user generated contents

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Identifying salient entities in web pages

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Leveraging the citation graph to recommend keywords

Proceedings of the 7th ACM conference on Recommender systems
Detecting topic labels for tweets by matching features from pseudo-relevance feedback

AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
Integrating semantic relatedness and words' intrinsic features for keyword extraction

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Automatic keyphrase extraction from scientific articles

Language Resources and Evaluation
Topic segmentation and labeling in asynchronous conversations

Journal of Artificial Intelligence Research
Contextual keyword extraction by building sentences with crowdsourcing

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, experiments on automatic extraction of keywords from abstracts using a supervised machine learning algorithm are discussed. The main point of this paper is that by adding linguistic knowledge to the representation (such as syntactic features), rather than relying only on statistics (such as term frequency and n-grams), a better result is obtained as measured by keywords previously assigned by professional indexers. In more detail, extracting NP-chunks gives a better precision than n-grams, and by adding the PoS tag(s) assigned to the term as a feature, a dramatic improvement of the results is obtained, independent of the term selection approach applied.