A parametric approach to vocal tract length normalization

Authors:
E. Eide;H. Gish
Affiliations:
BBN Syst. & Technol. Corp., Cambridge, MA, USA;BBN Syst. & Technol. Corp., Cambridge, MA, USA
Venue:
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Year:
1996

Citing 0
Cited 15

Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-Based Interactive Toy

AMT '01 Proceedings of the 6th International Computer Science Conference on Active Media Technology
Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs

Speech Communication
Automatic speech recognition and speech variability: A review

Speech Communication
Acoustic variability and automatic recognition of children's speech

Speech Communication
A shift-based approach to speaker normalization using non-linear frequency-scaling model

Speech Communication
Towards an intelligent acoustic front end for automatic speech recognition: built-in speaker normalization

EURASIP Journal on Audio, Speech, and Music Processing - Intelligent Audio, Speech, and Music Processing Applications
Simultaneous translation of lectures and speeches

Machine Translation
Accuracy improvement for a voice recognition using field association knowledge

International Journal of Computer Applications in Technology
Towards age-independent acoustic modeling

Speech Communication
Improved automatic speech recognition through speaker normalization

Computer Speech and Language
Speaker normalization via springy discriminant analysis and pitch estimation

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments

IEEE Transactions on Audio, Speech, and Language Processing
Statistical transformation of language and pronunciation models for spontaneous speech recognition

IEEE Transactions on Audio, Speech, and Language Processing
Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech

Pattern Recognition Letters
Prior-shared feature and model space speaker adaptation by consistently employing map estimation

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

Differences in vocal tract size among individual speakers contribute to the variability of speech waveforms. The first-order effect of a difference in vocal tract length is a scaling of the frequency axis; a female speaker, for example, exhibits formants roughly 20% higher than the formants of from a male speaker, with the differences most severe in open vocal tract configurations. We describe a parametric method of normalisation which counteracts the effect of varied vocal tract length. The method is shown to be effective across a wide range of recognition systems and paradigms, but is particularly helpful in the case of a small amount of training data.