AMT '01 Proceedings of the 6th International Computer Science Conference on Active Media Technology
Automatic speech recognition and speech variability: A review
Speech Communication
Acoustic variability and automatic recognition of children's speech
Speech Communication
EURASIP Journal on Audio, Speech, and Music Processing - Intelligent Audio, Speech, and Music Processing Applications
Simultaneous translation of lectures and speeches
Machine Translation
Accuracy improvement for a voice recognition using field association knowledge
International Journal of Computer Applications in Technology
Towards age-independent acoustic modeling
Speech Communication
Improved automatic speech recognition through speaker normalization
Computer Speech and Language
Speaker normalization via springy discriminant analysis and pitch estimation
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments
IEEE Transactions on Audio, Speech, and Language Processing
Statistical transformation of language and pronunciation models for spontaneous speech recognition
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
Differences in vocal tract size among individual speakers contribute to the variability of speech waveforms. The first-order effect of a difference in vocal tract length is a scaling of the frequency axis; a female speaker, for example, exhibits formants roughly 20% higher than the formants of from a male speaker, with the differences most severe in open vocal tract configurations. We describe a parametric method of normalisation which counteracts the effect of varied vocal tract length. The method is shown to be effective across a wide range of recognition systems and paradigms, but is particularly helpful in the case of a small amount of training data.