Posterior-based confidence measures for spoken term detection

  • Authors:
  • Dong Wang;Javier Tejedor;Joe Frankel;Simon King;Jose Colas

  • Affiliations:
  • The Centre for Speech Technology Research, University of Edinburgh, UK;The Centre for Speech Technology Research, University of Edinburgh, UK;The Centre for Speech Technology Research, University of Edinburgh, UK;The Centre for Speech Technology Research, University of Edinburgh, UK;Human Computer Technology Laboratory, Escuela Politecnica Superior, Universidad Autonoma de Madrid, Spain

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Confidence measures play a key role in spoken term detection (STD) tasks. The confidence measure expresses the posterior probability of the search term appearing in the detection period, given the speech. Traditional approaches are based on the acoustic and language model scores for candidate detections found using automatic speech recognition, with Bayes' rule being used to compute the desired posterior probability. In this paper, we present a novel direct posterior-based confidence measure which, instead of resorting to the Bayesian formula, calculates posterior probabilities from a multi-layer perceptron (MLP) directly. Compared with traditional Bayesian-based methods, the direct-posterior approach is conceptually and mathematically simpler. Moreover, the MLP-based model does not require assumptions to be made about the acoustic features such as their statistical distribution and the independence of static and dynamic co-efficients. Our experimental results in both English and Spanish demonstrate that the proposed direct posterior-based confidence improves STD performance.