Clinical entity recognition using structural support vector machines with rich features

Authors:
Buzhou Tang;Hongxin Cao;Yonghui Wu;Min Jiang;Hua Xu
Affiliations:
Vanderbilt University, Nashville, TN, USA;Vanderbilt University, Nashville, TN, USA & Second Military Medical University, Shanghai, China;Vanderbilt University, Nashville, TN, USA;Vanderbilt University, Nashville, TN, USA;Vanderbilt University, Nashville, TN, USA
Venue:
Proceedings of the ACM sixth international workshop on Data and text mining in biomedical informatics
Year:
2012

Citing 12
Cited 3

Class-based n-gram models of natural language

Computational Linguistics
Representing text chunks

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Chunking with support vector machines

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Large Margin Methods for Structured and Interdependent Output Variables

The Journal of Machine Learning Research
POSBIOTM---NER: a trainable biomedical named-entity recognition system

Bioinformatics
Use of support vector learning for chunk identification

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Comparisons of sequence labeling algorithms and extensions

Proceedings of the 24th international conference on Machine learning
Conditional random fields and support vector machines for disorder named entity recognition in clinical texts

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Cutting-plane training of structural SVMs

Machine Learning
Word representations: a simple and general method for semi-supervised learning

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Extracting named entities using support vector machines

KDLL'06 Proceedings of the 2006 international conference on Knowledge Discovery in Life Science Literature
Using an ensemble system to improve concept extraction from clinical records

Journal of Biomedical Informatics

DTMBIO 2012: international workshop on data and text mining in biomedical informatics

Proceedings of the 21st ACM international conference on Information and knowledge management
Exploring the effectiveness of medical entity recognition for clinical information retrieval

Proceedings of the 7th international workshop on Data and text mining in biomedical informatics
Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Named entity recognition (NER) is an important task for natural language processing (NLP) of clinical text. Conditional Random Fields (CRFs), a sequential labeling algorithm, and Support Vector Machines (SVMs), which is based on large margin theory, are two typical machine learning algorithms that have been widely applied to NER tasks, including clinical entity recognition. However, Structural Support Vector Machines (SSVMs), an algorithm that combines the advantages of both CRFs and SVMs, has not been investigated for clinical text processing. In this study, we applied the SSVMs algorithm to the Concept Extraction task of the 2010 i2b2 clinical NLP challenge, which was to recognize entities of medical problems, treatments, and tests from hospital discharge summaries. Using the same training (N = 27,837) and test (N = 45,009) sets in the challenge, our evaluation showed that the SSVMs-based NER system required less training time, while achieved better performance than the CRFs-based system for clinical entity recognition, when same features were used. Our study also demonstrated that rich features such as unsupervised word representations improved the performance of clinical entity recognition. When rich features were integrated with SSVMs, our system achieved a highest F-measure of 85.74% on the test set of 2010 i2b2 NLP challenge, which outperformed the best system reported in the challenge by 0.5%.