Prosody modeling with soft templates

Authors:
Greg Kochanski;Chilin Shih
Affiliations:
Bell Laboratories, Lucent Technologies, 600 Mountain Avenue, Murray Hill, NJ;Bell Laboratories, Lucent Technologies, 600 Mountain Avenue, Murray Hill, NJ
Venue:
Speech Communication
Year:
2003

Citing 2
Cited 6

SSML: a speech synthesis markup language

Speech Communication
Multilingual Text-to-Speech Synthesis

Multilingual Text-to-Speech Synthesis

Analysis and modeling of F0 contours for cantonese text-to-speech

ACM Transactions on Asian Language Information Processing (TALIP)
Applying data mining techniques to corpus based prosodic modeling

Speech Communication
A modular holistic approach to prosody modelling for Standard Yorùbá speech synthesis

Computer Speech and Language
Mechanisms of question intonation in mandarin

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Predicting utterance pitch targets in Yorùbá for tone realisation in speech synthesis

Speech Communication
Toward invariant functional representations of variable surface fundamental frequency contours: Synthesizing speech melody via model-based stochastic learning

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a novel prosody generation model. We intend it to broadly support many linguistic theories and multiple languages, for the model imposes no restriction on accent categories and shapes. This capability is crucial to the next generation of text-to-speech systems that will need to synthesize intonation variations for different speech acts, emotions, and styles of speech. The system supports mark-up tags that are mathematically defined and generate f0 deterministically. Underlying the tags is an articulatory model of accent interaction which balances physiological and communication constraints. We specify the model by way of an algorithm for calculating the pitch, and by way of examples. The model allows localized, linguistically reasonable tags, and is suitable for a data-driven fitting process.