Improving multi-lattice alignment based spoken keyword spotting

  • Authors:
  • Hui Lin;Alex Stupakov;Jeff Bilmes

  • Affiliations:
  • Department of Electrical Engineering, University of Washington, Seattle, USA;Department of Electrical Engineering, University of Washington, Seattle, USA;Department of Electrical Engineering, University of Washington, Seattle, USA

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In previous work, we showed that using a lattice instead of the 1-best path to represent both the query and the utterance being searched is beneficial for spoken keyword spotting. In this paper, we introduce several techniques that further improve our multi-lattice alignment approach, including edit operation modeling and supervised training of the conditional probability table, something which cannot be directly trained by traditional maximum likelihood estimation. Experiments on TIMIT show that the proposed methods significantly improve the performance of spoken keyword spotting.