Using multiple acoustic feature sets for speech recognition
Speech Communication
Automatic speech recognition and speech variability: A review
Speech Communication
Acoustic variability and automatic recognition of children's speech
Speech Communication
The application of hidden Markov models in speech recognition
Foundations and Trends in Signal Processing
Advances in Acoustic Modeling for the Recognition of Czech
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
EURASIP Journal on Audio, Speech, and Music Processing - Intelligent Audio, Speech, and Music Processing Applications
Towards age-independent acoustic modeling
Speech Communication
Improved automatic speech recognition through speaker normalization
Computer Speech and Language
An automatic retraining method for speaker independent hidden Markov models
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Error approximation and minimum phone error acoustic model estimation
IEEE Transactions on Audio, Speech, and Language Processing
Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments
IEEE Transactions on Audio, Speech, and Language Processing
Statistical transformation of language and pronunciation models for spontaneous speech recognition
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
EURASIP Journal on Audio, Speech, and Music Processing - Special issue on atypical speech
Pitch mean based frequency warping
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Hi-index | 0.00 |
In an effort to reduce the degradation in speech recognition performance caused by variation in vocal tract shape among speakers, a frequency warping approach to speaker normalization is investigated. A set of low complexity, maximum likelihood based frequency warping procedures have been applied to speaker normalization for a telephone based connected digit recognition task. This paper presents an efficient means for estimating a linear frequency warping factor and a simple mechanism for implementing frequency warping by modifying the filter-bank in mel-frequency cepstrum feature analysis. An experimental study comparing these techniques to other well-known techniques for reducing variability is described. The results showed that frequency warping was consistently able to reduce word error rate by 20% even for very short utterances.