Training Global Linear Models for Chinese Word Segmentation

  • Authors:
  • Dong Song;Anoop Sarkar

  • Affiliations:
  • School of Computing Science, Simon Fraser University, Burnaby, Canada V5A1S6;School of Computing Science, Simon Fraser University, Burnaby, Canada V5A1S6

  • Venue:
  • Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper examines how one can obtain state of the art Chinese word segmentation using global linear models. We provide experimental comparisons that give a detailed road-map for obtaining state of the art accuracy on various datasets. In particular, we compare the use of reranking with full beam search; we compare various methods for learning weights for features that are full sentence features, such as language model features; and, we compare an Averaged Perceptron global linear model with the Exponentiated Gradient max-margin algorithm.