Punctuation as implicit annotations for chinese word segmentation

  • Authors:
  • Zhongguo Li;Maosong Sun

  • Affiliations:
  • -;-

  • Venue:
  • Computational Linguistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a Chinese word segmentation model learned from punctuation marks which are perfect word delimiters. The learning is aided by a manually segmented corpus. Our method is considerably more effective than previous methods in unknown word recognition. This is a step toward addressing one of the toughest problems in Chinese word segmentation.