Improvements in connected digit recognition using higher order spectral and energy features

Authors:
J. G. Wilpon;C.-H. Lee;L. R. Rabiner
Affiliations:
AT& T Bell Labs., Murray Hill, NJ, USA;AT& T Bell Labs., Murray Hill, NJ, USA;AT& T Bell Labs., Murray Hill, NJ, USA
Venue:
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Year:
1991

Citing 0
Cited 8

A robust connected-words recognizer

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
HMM modeling for speaker independent voice dialing in car environment

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Segmental GPD training of HMM based speech recognizer

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Discriminative analysis for feature reduction in automatic speech recognition

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
High performance connected digit recognition using codebook exponents

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
A two pass classifier for utterance rejection in keyword spotting

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Minimum error rate training based on N-best string models

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Vector quantization for the efficient computation of continuous density likelihoods

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II

Quantified Score

Hi-index	0.00

Visualization

Abstract

It is shown how one can apply the improved acoustic modeling techniques (using a continuous density hidden Markov model framework) developed for large vocabulary speech recognition applications to the problem of connected digit recognition with no changes made to the basic modeling techniques and with no vocabulary specific information used. The improved modeling techniques adopted in this study include an improved feature analysis procedure, which incorporates higher order cepstral and log energy time derivatives, and an improved acoustic resolution procedure, which uses more Gaussian mixture components per state to characterize the acoustic variability in each state of the model. Using these techniques, string accuracies of 98.6% for unknown length strings and 99.2% for known length strings were achieved on the standard Texas Instruments connected digits database.