Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

Authors:
Sumio Ohno;Keikichi Hirose;Hiroya Fujisaki
Affiliations:
Dept. of Electronic Engineering, University of Tokyo, Tokyo, Japan;Dept. of Electronic Engineering, University of Tokyo, Tokyo, Japan;Dept. of Applied Electronics, Science University of Tokyo, Noda, Japan
Venue:
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Year:
1993

Citing 1
Cited 0

Automatic Speech Recognition: The Development of the Sphinx Recognition System

Automatic Speech Recognition: The Development of the Sphinx Recognition System

Quantified Score

Hi-index	0.00

Visualization

Abstract

Utterance normalization and multiple-template mathcing are two techniques that complement each other to cope with speaker variablity. This paper proposes a new method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate especially for smaller numbers of templates per word. Together with the fact that this method reduces the dimension of the feature vector by a factor of 4, the results demonstrate the validity of the proposed method.