Filterbank optimization for robust ASR using GA and PSO

Authors:
R. K. Aggarwal;M. Dave
Affiliations:
Department of Computer Engineering, N.I.T., Kurukshetra, India;Department of Computer Engineering, N.I.T., Kurukshetra, India
Venue:
International Journal of Speech Technology
Year:
2012

Citing 15
Cited 0

Assessment for automatic speech recognition II: NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems

Speech Communication - Special issue on speech processing in adverse conditions
Swarm intelligence

Swarm intelligence
Comparison of different implementations of MFCC

Journal of Computer Science and Technology
A genetic classification error method for speech recognition

Signal Processing
Data Driven Design of Filter Bank for Speech Recognition

TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Parameter Selection in Particle Swarm Optimization

EP '98 Proceedings of the 7th International Conference on Evolutionary Programming VII
Optimization of HMM by a Genetic Algorithm

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 3 - Volume 3
Modeling durations of syllables using neural networks

Computer Speech and Language
Springer Handbook of Speech Processing

Springer Handbook of Speech Processing
Acoustic modeling problem for automatic speech recognition system: conventional methods (Part I)

International Journal of Speech Technology
Acoustic modeling problem for automatic speech recognition system: advances and refinements (Part II)

International Journal of Speech Technology
New insights into the noise reduction Wiener filter

IEEE Transactions on Audio, Speech, and Language Processing
Ant colony system: a cooperative learning approach to the traveling salesman problem

IEEE Transactions on Evolutionary Computation
Particle Swarm Optimization: Basic Concepts, Variants and Applications in Power Systems

IEEE Transactions on Evolutionary Computation
A novel approach to HMM-based speech recognition systems using particle swarm optimization

Mathematical and Computer Modelling: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic speech recognition (ASR) systems follow a well established approach of pattern recognition, that is signal processing based feature extraction at front-end and likelihood evaluation of feature vectors at back-end. Mel-frequency cepstral coefficients (MFCCs) are the features widely used in state-of-the-art ASR systems, which are derived by logarithmic spectral energies of the speech signal using Mel-scale filterbank. In filterbank analysis of MFCC there is no consensus for the spacing and number of filters used in various noise conditions and applications. In this paper, we propose a novel approach to use particle swarm optimization (PSO) and genetic algorithm (GA) to optimize the parameters of MFCC filterbank such as the central and side frequencies. The experimental results show that the new front-end outperforms the conventional MFCC technique. All the investigations are conducted using two separate classifiers, HMM and MLP, for Hindi vowels recognition in typical field condition as well as in noisy environment.