Prosody modeling for mandarin exclamatory speech

Authors:
Huibin Jia;Jianhua Tao
Affiliations:
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences;National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
Venue:
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Year:
2009

Citing 4
Cited 0

Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds

Speech Communication
Pitch targets and their realization: evidence from Mandarin Chinese

Speech Communication
2005 Special Issue: Beyond emotion archetypes: Databases for emotion modelling using neural networks

Neural Networks - Special issue: Emotion and brain
Prosody conversion from neutral speech to emotional speech

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

To improve the expressiveness of speech synthesis, the paper proposes a model to simulate the prosody features of exclamatory speech with modal tags. While compared with reading speech, we found that the major difference between the reading speech and the kind of exclamatory speech is caused by the strong stresses on some modal words and their heavy impacts on adjacent speech units. Then, a CART-based prosody transformation model is introduced to automatically generate the prosody features of exclamatory speech by using reading speech as the baseline. Final perception and comparison experiments have proven the high quality of the model in the simulation of the kind of exclamatory speech.