New sub-band processing framework using non-linear predictive models for speech feature extraction

Authors:
Mohamed Chetouani;Amir Hussain;Bruno Gas;Jean-Luc Zarader
Affiliations:
Laboratoire des Instruments et Systèmes d'Ile-De-France, Université Paris VI, Paris, France;Dept. of Computing Science and Mathematics, University of Stirling, Scotland, UK;Laboratoire des Instruments et Systèmes d'Ile-De-France, Université Paris VI, Paris, France;Laboratoire des Instruments et Systèmes d'Ile-De-France, Université Paris VI, Paris, France
Venue:
NOLISP'05 Proceedings of the 3rd international conference on Non-Linear Analyses and Algorithms for Speech Processing
Year:
2005

Citing 5
Cited 0

Binaural sub-band adaptive speech enhancement using artifical neural networks

Speech Communication - Special issue on robust speech recognition
Speech and Audio Signal Processing: Processing and Perception of Speech and Music

Speech and Audio Signal Processing: Processing and Perception of Speech and Music
Subband Approach for Automatic Speaker Recognition: Optimal Division of the Frequency Domain

AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication
Sub-Band Based Recognition of Noisy Speech

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech feature extraction methods are commonly based on time and frequency processing approaches. In this paper, we propose a new framework based on sub-band processing and non-linear prediction. The key idea is to pre-process the speech signal by a filter bank. From the resulting signals, non-linear predictors are computed. The feature extraction method involves the association of different Neural Predictive Coding (NPC) models. We apply this new framework to phoneme classification and experiments carried out with the NTIMIT database show an improvement of the classification rates in comparison with the full-band approach. The new method is also shown to give better performance than the traditional Linear Predictive Coding (LPC), Mel Frequency Cepstral Coding (MFCC) and Perceptual Linear Prediction (PLP) methods.