Improvements in recognition of conversational telephone speech

  • Authors:
  • B. Peskin;M. Newman;D. McAllaster;V. Nagesha;H. Richards;S. Wegmann;M. Hunt;L. Gillick

  • Affiliations:
  • Dragon Syst. Inc., Newton, MA, USA;-;-;-;-;-;-;-

  • Venue:
  • ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes recent changes in Dragon's speech recognition system which have markedly improved performance on conversational telephone speech. Key changes include: the conversion to modified perceptual linear prediction (PLP)-based cepstra from mel-cepstra; the replacement of our usual IMELDA transformation by a new transform using "semi-tied covariance"; a new multi-pass adaptation protocol; probabilities on alternate pronunciations in the lexicon; the addition of word-boundary tags in our acoustic models and the redistribution of model parameters to build fewer output distributions but with more mixture components per model.