MMR-based active machine learning for bio named entity recognition

  • Authors:
  • Seokhwan Kim;Yu Song;Kyungduk Kim;Jeong-Won Cha;Gary Geunbae Lee

  • Affiliations:
  • POSTECH, Pohang, Korea;AIA Information Technology Co., Ltd. Beijing, China;POSTECH, Pohang, Korea;Changwon National University, Changwon, Korea;POSTECH, Pohang, Korea

  • Venue:
  • NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new active learning paradigm which considers not only the uncertainty of the classifier but also the diversity of the corpus. The two measures for uncertainty and diversity were combined using the MMR (Maximal Marginal Relevance) method to give the sampling scores in our active learning strategy. We incorporated MMR-based active machine-learning idea into the biomedical named-entity recognition system. Our experimental results indicated that our strategies for active-learning based sample selection could significantly reduce the human effort.