Semi supervised learning for prediction of prosodic phrase boundaries in chinese TTS using conditional random fields

  • Authors:
  • Ziping Zhao;Xirong Ma;Weidong Pei

  • Affiliations:
  • College of Computer and Information Engineering, College of Computer and Information Engineering, Tianjin, China;College of Computer and Information Engineering, College of Computer and Information Engineering, Tianjin, China;College of Computer and Information Engineering, College of Computer and Information Engineering, Tianjin, China

  • Venue:
  • ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper we proposed an approach for prediction of Chinese prosodic phrase boundaries from a limited amount of labeled training examples and some amount of unlabeled data using conditional random fields. Some useful unlabeled data are chosen based on the assigned labels and the prediction probabilities of the current learned model. The useful unlabeled data is then exploited to improve the learning. Experiments show that the approach improves overall performance. The precision and recall ratio are improved.