Speech concatenation and synthesis using an overlap-add sinusoidal model

Authors:
M. W. Macon;M. A. Clements
Affiliations:
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA;Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Venue:
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Year:
1996

Citing 0
Cited 1

Rhythm Speech Lyrics Input for MIDI-Based Singing Voice Synthesis

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.