Quantitative modeling of segmental duration

Authors:
Jan P. H. van Santen
Affiliations:
AT&T Bell Laboratories, Murray Hill, NJ
Venue:
HLT '93 Proceedings of the workshop on Human Language Technology
Year:
1993

Citing 4
Cited 1

From text to speech: the MITalk system

From text to speech: the MITalk system
Original Contribution: Optical character recognition by a neural network

Neural Networks
Contextual effects on vowel duration

Speech Communication
Exploring N-way tables with sums-of-products models

Journal of Mathematical Psychology

Modeling Phone Duration of Lithuanian by Classification and Regression Trees, using Very Large Speech Corpus

Informatica

Quantified Score

Hi-index	0.00

Visualization

Abstract

In natural speech, durations of phonetic segments are strongly dependent on contextual factors. Quantitative descriptions of these contextual effects have applications in text-to-speech synthesis and in automatic speech recognition. In this paper, we describe a speaker-dependent system for predicting segmental duration from text, with emphasis on the statistical methods used for its construction. We also report results of a subjective listening experiment evaluating an implementation of this system for text-to-speech synthesis purposes.