A supervised approach for gene mention detection

  • Authors:
  • Sriparna Saha;Asif Ekbal;Sanchita Saha

  • Affiliations:
  • Department of Computer Science and Engineering, Indian Institute of Technology Patna, India;Department of Computer Science and Engineering, Indian Institute of Technology Patna, India;Haldia Institute of Technology, India

  • Venue:
  • SEMCCO'11 Proceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Named Entity Recognition and Classification (NERC) is one of the most fundamental and important tasks in biomedical information extraction. Gene mention detection is concerned with the named entity (NE) extraction of gene and gene product mentions in text. Several different approaches have emerged but most of these state-of-the-art approaches suggest that individual NERC system may not cover entity representations with arbitrary set of features and cannot achieve best performance. In this paper, we propose a voted approach for gene mention detection. We use support vector machine (SVM) as the underlying classification methodology, and build different models of it depending upon the various representations of the set of features. One most important criterion of these features is that these are identified and selected largely without using any domain knowledge. Evaluation results with the benchmark dataset of GENTAG yields the state-of-the-art performance with the overall recall, precision and F-measure values of 94.95%, 94.32%, and 94.63%, respectively.