The application of kalman filter based human-computer learning model to chinese word segmentation

  • Authors:
  • Weimeng Zhu;Ni Sun;Xiaojun Zou;Junfeng Hu

  • Affiliations:
  • School of Electronics Engineering & Computer Science, Peking University, Beijing, P.R. China;Key Laboratory of Computational Linguistics, Ministry of Education, Peking University, Beijing, P.R. China;Key Laboratory of Computational Linguistics, Ministry of Education, Peking University, Beijing, P.R. China;School of Electronics Engineering & Computer Science, Peking University, Beijing, P.R. China,Key Laboratory of Computational Linguistics, Ministry of Education, Peking University, Beijing, P.R ...

  • Venue:
  • CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a human-computer interaction learning model for segmenting Chinese texts depending upon neither lexicon nor any annotated corpus. It enables users to add language knowledge to the system by directly intervening the segmentation process. Within limited times of user intervention, a segmentation result that fully matches the use (or with an accurate rate of 100% by manual judgement) is returned. A Kalman filter based model is adopted to learn and estimate the intention of users quickly and precisely from their interventions to reduce system prediction error hereafter. Experiments show that it achieves an encouraging performance in saving human effort and the segmenter with knowledge learned from users outperforms the baseline model by about 10% in segmenting homogenous texts.