An improved search algorithm using incremental knowledge for continuous speech recognition

Authors:
Fil Alleva;Xuedong Huang;Mei-Yuh Hwang
Affiliations:
School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania;School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania;School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania
Venue:
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Year:
1993

Citing 2
Cited 1

Problem-Solving Methods in Artificial Intelligence

Problem-Solving Methods in Artificial Intelligence
Recent progress on the VOYAGER system

HLT '90 Proceedings of the workshop on Speech and Natural Language

Benchmark tests for the DARPA Spoken Language Program

HLT '93 Proceedings of the workshop on Human Language Technology

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper, we propose a search algorithm that incrementally makes effective use of detailed sources of knowledge. The proposed algorithm incrementally applies all available acoustic and linguistic infomation in three search phases. Phase one is a left to right Viterbi beam search that produces word end times and scores using right context between-word models with a bigram language model. Phase two, guided by results from phase one, is a right to left Viterbi beam search that produces word begin times and scores based on left context between-word models. Phase three is an A* search that combines the results of phases one and two with a long distance language model. Our objective is to maximize the recognition accuracy with a minimal increase in computational cost. With our decomposed, incremental, search algorithm, we show that early use of detailed acoustic models can significantly reduce the recognition error rate with a negligible increase in computational cost.