High-quality speech coding at 2.4 to 4.0 kbps based on time-frequency interpolation

Authors:
Yair Shoham
Affiliations:
Speech Coding Research Department, AT&T Bell Laboratories, Murray Hill, NJ
Venue:
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Year:
1993

Citing 1
Cited 1

Continuous representations in linear predictive coding

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference

Signal transformation and interpolation based on modified DCT synthesis

Digital Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a new algorithm for high-quality speech coding and demonstrates the advantage of the proposed coder over the conventional CELP algorithm for low rate coding. The paper proposes an empirical but perceptually advantageous framework for voiced speech processing, called Time-Frequency Interpolation (TFI). The general formulation of the TFI technique is given first. Then, a TFI speech coder is described. The performance of this coder at 4.05 and 2.5 Kbps is demonstrated in terms of formal MOS scores. It is shown that the 4.05 Kbps TFI coder is comparable in performance to the 8 Kbps North-American cellular standard IS54 coder and to the 13 Kbps European standard GSM coder. It is further shown that decreasing the bit rate to 2.50 Kbps only gracefully deteriorates the performance and the coder delivers goodquality speech at this rate.