From text to speech: the MITalk system
From text to speech: the MITalk system
Contextual effects on vowel duration
Speech Communication
Exploring N-way tables with sums-of-products models
Journal of Mathematical Psychology
Hi-index | 0.00 |
In natural speech, durations of phonetic segments are strongly dependent on contextual factors. Quantitative descriptions of these contextual effects have applications in text-to-speech synthesis and in automatic speech recognition. In this paper, we describe a speaker-dependent system for predicting segmental duration from text, with emphasis on the statistical methods used for its construction. We also report results of a subjective listening experiment evaluating an implementation of this system for text-to-speech synthesis purposes.