POSBIOTM-NER in the shared task of BioNLP/NLPBA 2004

  • Authors:
  • Yu Song;Eunju Kim;Gary Geunbae Lee;Byoung-kee Yi

  • Affiliations:
  • Pohang University of Science and Technology (POSTECH), Pohang, Korea;Pohang University of Science and Technology (POSTECH), Pohang, Korea;Pohang University of Science and Technology (POSTECH), Pohang, Korea;Pohang University of Science and Technology (POSTECH), Pohang, Korea

  • Venue:
  • JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Two classifiers -- Support Vector Machine (SVM) and Conditional Random Fields (CRFs) are applied here for the recognition of biomedical named entities. According to their different characteristics, the results of two classifiers are merged to achieve better performance. We propose an automatic corpus expansion method for SVM and CRF to overcome the shortage of the annotated training data. In addition, we incorporate a keyword-based post-processing step to deal with the remaining problems such as assigning an appropriate named entity tag to the word/phrase containing parentheses.