Introduction to CKIP Chinese word segmentation system for the first international Chinese Word Segmentation Bakeoff

  • Authors:
  • Wei-Yun Ma;Keh-Jiann Chen

  • Affiliations:
  • Institute of Information science, Academia Sinica;Institute of Information science, Academia Sinica

  • Venue:
  • SIGHAN '03 Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we roughly described the procedures of our segmentation system, including the methods for resolving segmentation ambiguities and identifying unknown words. The CKIP group of Academia Sinica participated in testing on open and closed tracks of Beijing University (PK) and Hong Kong Cityu (HK). The evaluation results show our system performs very well in either HK open track or HK closed track and just acceptable in PK tracks. Some explanations and analysis are presented in this paper.