The Lincoln large-vocabulary stack-decoder based HMM CSR

Authors:
Douglas B. Paul
Affiliations:
Lincoln Laboratory, MIT, Lexington, MA
Venue:
HLT '94 Proceedings of the workshop on Human Language Technology
Year:
1994

Citing 2
Cited 1

Experience with a stack decoder-based HMM CSR and back-OFF N-gram language models

HLT '91 Proceedings of the workshop on Speech and Natural Language
A rapid match algorithm for continuous speech recognition

HLT '90 Proceedings of the workshop on Speech and Natural Language

1993 benchmark tests for the ARPA spoken language program

HLT '94 Proceedings of the workshop on Human Language Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

The system described here is a large-vocabulary continuous-speech recognition (CSR) system with results obtained using the Wall Street Journal-based database [15]. The recognizer uses a stack decoder-based search strategy[1, 7, 14] with a left-to-right stochastic language model. This decoder has been shown to function effectively on 20K and 64K-word recognition of continuous speech. It operates left-to-right and can produce final textual output while continuing to accept additional input speech. Thus it need not wait for the end of the sentence and can be structured so that it can accept an unbounded length stream of input speech. The recognizer also features recognition-time adaptation to the user's voice. This system showed improvements of 42% for a 5K vocabulary and 35% for a 20K vocabulary compared to the November 92 evaluation test system.