Phonetically-driven CELP coding using self-organizing maps

  • Authors:
  • Luis A. Hernández-Gómez;Eduardo López-Gonzalo

  • Affiliations:
  • Dpto. SSR, ETSI Telecomunicación, UPM, Madrid, Spain;Dpto. SSR, ETSI Telecomunicación, UPM, Madrid, Spain

  • Venue:
  • ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

The aim of the paper is to discuss how to combine efficient representations of both LPC parameters and excitation in CELP coders. For this purpose, we have tried a technique used both for VQ and speech recognition. LPC representations will be based on the quantization properties of Self-organizing Maps (SOM) that show as well good properties to detect spectral transitions, and can be used to select a phonetically driven excitation form. The trajectories in SOM's are exploited looking for the improvement of LPC-based speech coders in three different directions: a) to obtain phonetic classifications that assist speech coders to improve the representation of each specific class; b) to save bits by efficient encoding of the LPC envelope; and c) to provide fast search procedures in VQ. These three topics are used to improve the present quality of CELP coders at 4800 bps. and reduce the bit rate while keeping the quality of the synthetic speech. An improved 2400 bps CELP coder is proposed.