Pitch targets and their realization: evidence from Mandarin Chinese
Speech Communication
Prosodic boundary prediction based on maximum entropy model with error-driven modification
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Hi-index | 0.00 |
Emotion helps human to express their feelings and intentions clearly. And the emphasis labels of speeches are the key of speech emotion analysis and synthesis. In order to label the emotion emphasis of speech samples from a corpus with only phonetic and prosodic information, this paper introduces an automatic labeling algorithm by measuring the prosody generation error (PGE) of the result from a statistical synthesizer. Classification and Regression Tree (CART) and Maximum Entropy (ME) modeling are adopted for automatically labeling. Experiment shows that both models are helpful for labeling.