An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech

  • Authors:
  • Werner Verhelst;Marc Roelands

  • Affiliations:
  • Vrije Universiteit Brussel, Faculty of Applied Science, Dept. ETRO/DSSP, Brussels, Belgium;Vrije Universiteit Brussel, Faculty of Applied Science, Dept. ETRO/DSSP, Brussels, Belgium

  • Venue:
  • ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

A concept of waveform similarity is proposed for tackling the problem of time-scale modification of speech, and is workedout in the context of short-time Fourier transform representations. The resulting WSOLA algorithm produces high quality speech output, is algorithmically and computationally efficient and robust, and allows for on-line processing with arbitrary timescaling factors that may be specified in a time-varying fashion and that can be chosen over a wide continuous range of values.