Incorporating lexical knowledge into biomedical NE recognition

  • Authors:
  • Kyung-Mi Park;Seon-Ho Kim;Ki-Joong Lee;Do-Gil Lee;Hae-Chang Rim

  • Affiliations:
  • Korea University, Seoul, Korea;Korea University, Seoul, Korea;Korea University, Seoul, Korea;Korea University, Seoul, Korea;Korea University, Seoul, Korea

  • Venue:
  • JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a two-phase biomedical named entity(NE) recognition method based on SVMs. We first recognize biomedical terms, and then assign appropriate semantic classes to the recognized terms. In the two-phase NE recognition method, the performance of term recognition is critical to the overall performance of the system because term recognition errors can be propagated to the semantic classification phase. In this study, we try to improve the performance of term recognition by using lexical knowledge. We utilize salient NPs and collocations as lexical knowledge extracted from raw corpus. In addition, we use morphological knowledge extracted from training data as features for learning SVM classifiers. Experimental results show that our system obtains an F-measure of 62.97% on the test data, and that the performance can be improved upto 2.82% by using lexical knowledge.