Fundamentals of speech recognition
Fundamentals of speech recognition
Human Speech Perception: Some Lessons from Automatic Speech Recognition
TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Hi-index | 0.00 |
An analysis based on wavelet modulation scales feature extraction is proposed. Considering human auditory perception and varieties of disturbances, instead of the frequency differences, wavelet modulation scales are adopted to reflect the dynamic features of speech in ASR. Experiments for the Chinese digit-string recognition show extracting the wavelet modulation scales as the dynamic features have good performance both in additional noises and convolutional noises environment.