On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions

Authors:
Sid-Ahmed Selouani;Douglas O'Shaughnessy
Affiliations:
Secteur Gestion de l'Information, Université de Moncton, Shippagan, Nouveau-Brunswick, Canada;INRS-Energie-Matériaux-Télécommunications, Université du Québec, Gauchetière Ouest, place Bonaventure, Montréal, Canada
Venue:
EURASIP Journal on Applied Signal Processing
Year:
2003

Citing 7
Cited 1

Classifier systems and genetic algorithms

Artificial Intelligence
Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Genetic algorithms + data structures = evolution programs (2nd, extended ed.)

Genetic algorithms + data structures = evolution programs (2nd, extended ed.)
Speech recognition in noisy environments: a survey

Speech Communication
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Selection of Training Data for Neural Networks by a Genetic Algorithm

PPSN V Proceedings of the 5th International Conference on Parallel Problem Solving from Nature
Decision trees for phonological rules in continuous speech

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference

Incorporating phonetic knowledge into an evolutionary subspace approach for robust speech recognition

International Journal of Computers and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-Loève transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique, when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios (SNRs) varying from 16 dB to -4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations.