Bidirectional inference with the easiest-first strategy for tagging sequence data

  • Authors:
  • Yoshimasa Tsuruoka;Jun'ichi Tsujii

  • Affiliations:
  • CREST, JST (Japan Science and Technology Corporation), Saitama, Japan and University of Tokyo, Tokyo, Japan;University of Tokyo, Tokyo, Japan and University of Manchester, MANCHESTER, UK and CREST, JST (Japan Science and Technology Corporation), Saitama, Japan

  • Venue:
  • HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a bidirectional inference algorithm for sequence labeling problems such as part-of-speech tagging, named entity recognition and text chunking. The algorithm can enumerate all possible decomposition structures and find the highest probability sequence together with the corresponding decomposition structure in polynomial time. We also present an efficient decoding algorithm based on the easiest-first strategy, which gives comparably good performance to full bidirectional inference with significantly lower computational cost. Experimental results of part-of-speech tagging and text chunking show that the proposed bidirectional inference methods consistently outperform unidirectional inference methods and bidirectional MEMMs give comparable performance to that achieved by state-of-the-art learning algorithms including kernel support vector machines.