Chinese prosodic phrasing with the source-channel modela

  • Authors:
  • Honghui Dong;Yong Qin;Limin Jia

  • Affiliations:
  • State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing;State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing;State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing

  • Venue:
  • CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The prosodic phrasing is a classic problem in nature language process, which is not only useful for text-to-speech (TTS), but for speech recognition, statistic machine learning etc.. This paper introduces and discusses the sourcechannel model for Chinese prosodic phrasing. Based on the basic idea, the Hidden Markov Model (HMM) and the improved source-channel model are both used to describe the phrasing problem. In the improved source-channel model, maximum entropy model is used, and the discriminative training is introduced. And the rhythm model is proposed to describe the property of the utterance. The phrase-length model and the foot-pattern model both are used to describe the rhythm model, respectively. The experiments show that this approach achieved a good performance for prosodic phrasing. The improved source-channel model achieve a better performance than the Hidden Markov Model. And the foot-pattern model is the better one as a rhythm model.