Prediction of prosodic phrase boundaries in Chinese TTS based on conditional random fields and transformation based learning

  • Authors:
  • Ziping Zhao;Yaoting Zhu

  • Affiliations:
  • College of Computer and Information Engineering, Tianjin Normal University, Tianjin, China;College of Information Technical Science, Nankai University, Tianjin, China

  • Venue:
  • FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 2
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper we proposed an approach for prediction of Chinese prosodic phrase boundaries in unrestricted Chinese text, which combines Conditional Random Fields (CRFs) model and TBL model. First a CRFs model is trained to predict the prosodic phrase boundaries. After that we apply a TBL based error driven learning approach to amend the initial prediction. A comparison is conducted between the new model and HMM for prosodic phrase break prediction. Experiments show that the combined approach improves overall performance. The precision and recall ratio are improved.