An improved search algorithm using incremental knowledge for continuous speech recognition

  • Authors:
  • Fil Alleva;Xuedong Huang;Mei-Yuh Hwang

  • Affiliations:
  • School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania;School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania;School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania

  • Venue:
  • ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
  • Year:
  • 1993

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we propose a search algorithm that incrementally makes effective use of detailed sources of knowledge. The proposed algorithm incrementally applies all available acoustic and linguistic infomation in three search phases. Phase one is a left to right Viterbi beam search that produces word end times and scores using right context between-word models with a bigram language model. Phase two, guided by results from phase one, is a right to left Viterbi beam search that produces word begin times and scores based on left context between-word models. Phase three is an A* search that combines the results of phases one and two with a long distance language model. Our objective is to maximize the recognition accuracy with a minimal increase in computational cost. With our decomposed, incremental, search algorithm, we show that early use of detailed acoustic models can significantly reduce the recognition error rate with a negligible increase in computational cost.