Efficient inference of CRFs for large-scale natural language data

  • Authors:
  • Minwoo Jeong;Chin-Yew Lin;Gary Geunbae Lee

  • Affiliations:
  • Pohang University of Science & Technology, Pohang, Korea;Microsoft Research Asia, Beijing, China;Pohang University of Science & Technology, Pohang, Korea

  • Venue:
  • ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an efficient inference algorithm of conditional random fields (CRFs) for large-scale data. Our key idea is to decompose the output label state into an active set and an inactive set in which most unsupported transitions become a constant. Our method unifies two previous methods for efficient inference of CRFs, and also derives a simple but robust special case that performs faster than exact inference when the active sets are sufficiently small. We demonstrate that our method achieves dramatic speedup on six standard natural language processing problems.