Hammerstein model for speech coding

Authors:
Jari Turunen;Juha T. Tanttu;Pekka Loula
Affiliations:
Department of Information Technology, Tampere University of Technology, Pori, Pohjoisranta, Pori, Finland;Department of Information Technology, Tampere University of Technology, Pori, Pohjoisranta, Pori, Finland;Department of Information Technology, Tampere University of Technology, Pori, Pohjoisranta, Pori, Finland
Venue:
EURASIP Journal on Applied Signal Processing
Year:
2003

Citing 7
Cited 4

System identification: theory for the user

System identification: theory for the user
Modelling biphonation—the role of the vocal tract

Speech Communication - Special issue on speech production: models and data
An optimal two-stage identification algorithm for Hammerstein-Wiener nonlinear systems

Automatica (Journal of IFAC)
Nonlinear Long-Term Prediction of Speech Signals1

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Time-Frequency Analysis of the Glottal Opening

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Speech nonlinearities, modulations, and energy operators

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Nonlinear prediction of speech

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference

Phoneme analysis based on quantitative and qualitative entropy measurement

Computer Speech and Language
Bi-Spectral Acoustic Features for Robust Speech Recognition

IEICE - Transactions on Information and Systems
Exploiting nonlinearity in adaptive signal processing

NOLISP'07 Proceedings of the 2007 international conference on Advances in nonlinear speech processing
The system identification and control of Hammerstein system using non-uniform rational B-spline neural network and particle swarm optimization

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A nonlinear Hammerstein model is proposed for coding speech signals. Using Tsay's nonlinearity test, we first show that the great majority of speech frames contain nonlinearities (over 80% in our test data) when using 20-millisecond speech frames. Frame length correlates with the level of nonlinearity: the longer the frames the higher the percentage of nonlinear frames. Motivated by this result, we present a nonlinear structure using a frame-by-frame adaptive identification of the Hammerstein model parameters for speech coding. Finally, the proposed structure is compared with the LPC coding scheme for three phonemes /a/, /s/, and /k/ by calculating the Akaike information criterion of the corresponding residual signals. The tests show clearly that the residual of the nonlinear model presented in this paper contains significantly less information compared to that of the LPC scheme. The presented method is a potential tool to shape the residual signal in an encode-efficient form in speech coding.