Prosodic boundary prediction based on maximum entropy model with error-driven modification

  • Authors:
  • Xiaonan Zhang;Jun Xu;Lianhong Cai

  • Affiliations:
  • Key Laboratory of Pervasive Computing (Tsinghua University), Ministry of Education, Beijing;Key Laboratory of Pervasive Computing (Tsinghua University), Ministry of Education, Beijing;Key Laboratory of Pervasive Computing (Tsinghua University), Ministry of Education, Beijing

  • Venue:
  • ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

Prosodic boundary prediction is the key to improving the intelligibility and naturalness of synthetic speech for a TTS system. This paper investigated the problem of automatic segmentation of prosodic word and prosodic phrase, which are two fundamental layers in the hierarchical prosodic structure of Mandarin Chinese. Maximum Entropy (ME) Model was used at the front end for both prosodic word and prosodic phrase prediction, but with different feature selection schemes. A multi-pass prediction approach was adopted. Besides, an error-driven rule-based modification module was introduced into the back end to amend the initial prediction. Experiments showed that this combined approach outperformed many other methods like C4.5 and TBL.