Data Driven Design of Filter Bank for Speech Recognition

Authors:
Lukas Burget;Hynek Hermansky
Affiliations:
-;-
Venue:
TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Year:
2001

Citing 3
Cited 2

Fundamentals of speech recognition

Fundamentals of speech recognition
Speech and Audio Signal Processing: Processing and Perception of Speech and Music

Speech and Audio Signal Processing: Processing and Perception of Speech and Music
Data-driven methods for extracting features from speech

Data-driven methods for extracting features from speech

Evolutionary splines for cepstral filterbank optimization in phoneme classification

EURASIP Journal on Advances in Signal Processing - Special issue on biologically inspired signal processing: analyses, algorithms and applications
Filterbank optimization for robust ASR using GA and PSO

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Filter bank approach is commonly used in feature extraction phase of speech recognition (e.g. Mel frequency cepstral coefficients). Filter bank is applied for modification of magnitude spectrum according to physiological and psychological findings. However, since mechanism of human auditory system is not fully understood, the optimal filter bank parameters are not known. This work presents a method where the filter bank, optimized for discriminability between phonemes, is derived directly from phonetically labeled speech data using Linear Discriminant Analysis. This work can be seen as another proof of the fact that incorporation of psychoacoustic findings into feature extraction can lead to better recognition performance.