Scalable biomedical Named Entity Recognition: investigation of a database-supported SVM approach

  • Authors:
  • Mona Soliman Habib;Jugal Kalita

  • Affiliations:
  • Cairo Microsoft Innovation Lab, 306 Korniche El-Nile, Maadi Cairo, Egypt.;Department of Computer Science, University of Colorado, 1420 Austin Bluffs Pkwy, Colorado Springs, CO 80918, USA

  • Venue:
  • International Journal of Bioinformatics Research and Applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper explores scalability issues associated with the Named Entity Recognition problem in the biomedical publications domain using Support Vector Machines. The performance results using existing binary and multi-class SVMs with increasing training data are compared to results obtained using our new implementations. Our approach eliminates prior language or domain-specific knowledge and achieves good out-of-the-box accuracy measures comparable to those obtained using more complex approaches. The training time of multi-class SVMs is reduced by several orders of magnitude, which would make support vector machines a more viable and practical solution for real-world problems with large datasets.