Semi-supervised active learning for sequence labeling

  • Authors:
  • Katrin Tomanek;Udo Hahn

  • Affiliations:
  • Jena University Language & Information Engineering (Julie) Lab, Friedrich-Schiller-Universitäät Jena, Germany;Jena University Language & Information Engineering (Julie) Lab, Friedrich-Schiller-Universitäät Jena, Germany

  • Venue:
  • ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

While Active Learning (AL) has already been shown to markedly reduce the annotation efforts for many sequence labeling tasks compared to random selection, AL remains unconcerned about the internal structure of the selected sequences (typically, sentences). We propose a semi-supervised AL approach for sequence labeling where only highly uncertain subsequences are presented to human annotators, while all others in the selected sequences are automatically labeled. For the task of entity recognition, our experiments reveal that this approach reduces annotation efforts in terms of manually labeled tokens by up to 60% compared to the standard, fully supervised AL scheme.