Recognizing biomedical named entities using SVMs: improving recognition performance with a minimal set of features

Authors:
Nazife Dimililer;Ekrem Varoğlu
Affiliations:
Computer Engineering Department, Eastern Mediterranean University, Turkey;Computer Engineering Department, Eastern Mediterranean University, Turkey
Venue:
KDLL'06 Proceedings of the 2006 international conference on Knowledge Discovery in Life Science Literature
Year:
2006

Citing 10
Cited 4

Foundations of statistical natural language processing

Foundations of statistical natural language processing
Information Retrieval

Information Retrieval
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Comparison of character-level and part of speech features for name recognition in biomedical texts

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Chunking with support vector machines

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Recognizing names in biomedical texts: a machine learning approach

Bioinformatics
Named entity recognition using hundreds of thousands of features

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
The GENIA corpus: an annotated research abstract corpus in molecular biology domain

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Exploiting context for biomedical entity recognition: from syntax to the web

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Exploring deep knowledge resources in biomedical name recognition

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications

Vote-Based Classifier Selection for Biomedical NER Using Genetic Algorithms

IbPRIA '07 Proceedings of the 3rd Iberian conference on Pattern Recognition and Image Analysis, Part II
@Note: A workbench for Biomedical Text Mining

Journal of Biomedical Informatics
Classifier subset selection for biomedical named entity recognition

Applied Intelligence
A generic classifier-ensemble approach for biomedical named entity recognition

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, Support Vector Machines (SVMs) are applied to the identification and automatic annotation of biomedical named entities in the domain of molecular biology, as an extension of the traditional named entity recognition task to special domains. The effect of the use of well-known features such as word formation patterns, lexical, morphological, and surface words on recognition performance is investigated. Experiments have been conducted using the train and test data made public at the Bio-Entity Recognition Task at JNLPBA 2004. An F-score of 69.87% was obtained by using a carefully selected combination of a minimal set of features, which can be easily computed from training data without any use of post-processing or external resources.