Boosting the protein name recognition performance by bootstrapping on selected text

  • Authors:
  • Yue Wang;Jin-Dong Kim

  • Affiliations:
  • Database Center for Life Science, Research Organization of Information and Systems, Yayoi, Bunkyo-ku, Tokyo, Japan;Database Center for Life Science, Research Organization of Information and Systems, Yayoi, Bunkyo-ku, Tokyo, Japan

  • Venue:
  • BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

When only a small amount of manually annotated data is available, application of a bootstrapping method is often considered to compensate for the lack of sufficient training material for a machine-learning method. The paper reports a series of experimental results of bootstrapping for protein name recognition. The results show that the performance changes significantly according to the choice of text collection where the training samples to bootstrap, and that an improvement can be obtained only with a well chosen text collection.