An instantaneous amplitude model based speech coder

Authors:
Cong Yu;Gang Li;Chaogeng Huang
Affiliations:
College of Information Engineering, Zhejiang University of Technology, Hangzhou, P.R. China;College of Information Engineering, Zhejiang University of Technology, Hangzhou, P.R. China;College of Information Engineering, Zhejiang University of Technology, Hangzhou, P.R. China
Venue:
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Year:
2009

Citing 4
Cited 0

Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals
Speech Coding Algorithms: Foundation and Evolution of Standardized Coders

Speech Coding Algorithms: Foundation and Evolution of Standardized Coders
An improved mixed excitation linear prediction (MELP) coder

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Estimation of amplitude and phase parameters of multicomponentsignals

IEEE Transactions on Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a new algorithm for speech coding is proposed. This algorithm is based a revised sinusoidal model, in which each component is represented with two instantaneous amplitudes and a frequency. This model avoids the difficulty in estimating the highly nonlinear phases and allows one to optimize the amplitudes once the frequencies are estimated. Simulations indicate that the proposed model can represent speech signals very well. Furthermore, based on this model, a speech coder is developed, in which all the estimated magnitudes are approximated with a four-parameter model. With the four parameters, the obtained phases and frequencies encoded using the simplest linear (scalar) quantization, a 16.64 kb/s speech coder yielding high quality synthetic speech signals.