Speech concatenation and synthesis using an overlap-add sinusoidal model

  • Authors:
  • M. W. Macon;M. A. Clements

  • Affiliations:
  • Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA;Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA

  • Venue:
  • ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.