Combining Supervised Learning Techniques to Key-Phrase Extraction for Biomedical Full-Text

Authors:
Min Song;Yanliang Qi;Suk-Chung Yoon;Lori deVersterre
Affiliations:
New Jersey Institute of Technology, USA;New Jersey Institute of Technology, USA;Widener University, USA;New Jersey Institute of Technology, USA
Venue:
International Journal of Intelligent Information Technologies
Year:
2011

Citing 12
Cited 3

Inductive learning algorithms and representations for text categorization

Proceedings of the seventh international conference on Information and knowledge management
Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
KEA: practical automatic keyphrase extraction

Proceedings of the fourth ACM conference on Digital libraries
Learning Algorithms for Keyphrase Extraction

Information Retrieval
The C-value/NC-value Method of Automatic Recognition for Multi-Word Terms

ECDL '98 Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries
KPSpotter: a flexible information gain-based keyphrase extraction system

WIDM '03 Proceedings of the 5th ACM international workshop on Web information and data management
Narrative text classification for automatic key phrase extraction in web document corpora

Proceedings of the 7th annual ACM international workshop on Web information and data management
Thesaurus based automatic keyphrase indexing

Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Investigating the Performance of Naive- Bayes Classifiers and K- Nearest Neighbor Classifiers

ICCIT '07 Proceedings of the 2007 International Conference on Convergence Information Technology
Automatic keyphrase extraction from scientific documents using N-gram filtration technique

Proceedings of the eighth ACM symposium on Document engineering
Iterative feature construction for improving inductive learning algorithms

Expert Systems with Applications: An International Journal
Domain-specific keyphrase extraction

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2

Wireless Sensor Node Placement Using Hybrid Genetic Programming and Genetic Algorithms

International Journal of Intelligent Information Technologies
A Modified Watershed Segmentation Method to Segment Renal Calculi in Ultrasound Kidney Images

International Journal of Intelligent Information Technologies
Low Dimensional Data Privacy Preservation Using Multi Layer Artificial Neural Network

International Journal of Intelligent Information Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Key-phrase extraction plays a useful a role in research areas of Information Systems IS like digital libraries. Short metadata like key phrases are beneficial for searchers to understand the concepts found in the documents. This paper evaluates the effectiveness of different supervised learning techniques on biomedical full-text: Sequential Minimal Optimization SMO and K-Nearest Neighbor, both of which could be embedded inside an information system for document search. The authors use these techniques to extract key phrases from PubMed and evaluate the performance of these systems using the holdout validation method. This paper compares different classifier techniques and performance differences between the full-text and it's abstract. Compared with the authors' previous work, which investigated the performance of Naïve Bayes, Linear Regression and SVMreg1/2, this paper finds that SVMreg-1 performs best in key-phrase extraction for full-text, whereas Naïve Bayes performs best for abstracts. These techniques should be considered for use in information system search functionality. Additional research issues also are identified.