Feature Fusion Applied to Missing Data ASR with the Combination of Recognizers

Authors:
Neil Joshi;Ling Guan
Affiliations:
Department of Electrical and Computer Engineering, Ryerson University , Toronto, Toronto, Canada;Department of Electrical and Computer Engineering, Ryerson University, Toronto, Canada
Venue:
Journal of Signal Processing Systems
Year:
2010

Citing 8
Cited 0

Mixed Memory Markov Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler Ones

Machine Learning
Robust automatic speech recognition with missing and unreliable acoustic data

Speech Communication
Vision: A Computational Investigation into the Human Representation and Processing of Visual Information

Vision: A Computational Investigation into the Human Representation and Processing of Visual Information
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
On noise masking for automatic missing data speech recognition: A survey and discussion

Computer Speech and Language
A fused hidden Markov model with application to bimodal speech processing

IEEE Transactions on Signal Processing
A review of speech-based bimodal recognition

IEEE Transactions on Multimedia
Input-output HMMs for sequence processing

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic speech recognition under adverse noise conditions has been a challenging problem. Under noise conditions when the stationarity assumption is valid, effective techniques have been established to provide excellent recognition accuracies. Under the conditions when this assumption cannot hold, recognition performance de- clines rapidly. Missing data, MD, theory is a promising method for robust automatic speech recognition, ASR, under an y noise condition. Unfortunately, the choice of feature used in the recognizer process is commonly limited to spectral based representations. The combination of recognizers approach to MD ASR allows the use of cepstral based features within the MD framework through a fusion of features mechanism in the pat- tern recognition stage. It was found that under two types of non-stationary noise conditions the combined fused effect, experienced by the fusion process, increased recognition accuracies substantially over traditional MD and cepstral based recognizers.