Training conditional random fields with multivariate evaluation measures

  • Authors:
  • Jun Suzuki;Erik McDermott;Hideki Isozaki

  • Affiliations:
  • NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan;NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan;NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan

  • Venue:
  • ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a framework for training Conditional Random Fields (CRFs) to optimize multivariate evaluation measures, including non-linear measures such as F-score. Our proposed framework is derived from an error minimization approach that provides a simple solution for directly optimizing any evaluation measure. Specifically focusing on sequential segmentation tasks, i.e. text chunking and named entity recognition, we introduce a loss function that closely reflects the target evaluation measure for these tasks, namely, segmentation F-score. Our experiments show that our method performs better than standard CRF training.