Improved acoustic modeling with the SPHINX speech recognition system

Authors:
X. D. Huang;K. F. Lee;H. W. Hon;M. Y. Hwang
Affiliations:
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA;Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA;Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA;Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Venue:
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Year:
1991

Citing 0
Cited 6

Session 2: DARPA resource management and ATIS benchmark test poster session

HLT '91 Proceedings of the workshop on Speech and Natural Language
Automatic gender recognition

ICECS'03 Proceedings of the 2nd WSEAS International Conference on Electronics, Control and Signal Processing
Automatic speech-based classification of gender, age and accent

PKAW'10 Proceedings of the 11th international conference on Knowledge management and acquisition for smart systems and services
Subphonetic modeling with Markov states: senone

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Discriminative analysis for feature reduction in automatic speech recognition

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
A successive state splitting algorithm for efficient allophone modeling

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

The authors report recent efforts to further improve the performance of the SPHINX system for speaker-independent continuous speech recognition. They adhere to the basic architecture of the SPHINX system and use the DARPA resource management task and training corpus. The improvements are evaluated on the 600 sentences that comprise the DARPA February and October 1989 test sets. Several techniques that substantially reduced SPHINX's error rate are presented. These techniques include dynamic features, semicontinuous hidden Markov models, speaker clustering, and the shared distribution modeling. The error rate of the baseline system was reduced by 45%.