Training set issues in SRI's DECIPHER speech recognition system

  • Authors:
  • Hy Murveit;Mitch Weintraub;Mike Cohen

  • Affiliations:
  • -;-;-

  • Venue:
  • HLT '90 Proceedings of the workshop on Speech and Natural Language
  • Year:
  • 1990

Quantified Score

Hi-index 0.00

Visualization

Abstract

SRI has developed the DECIPHER system, a hidden Markov model (HMM) based continuous speech recognition system typically used in a speaker-independent manner. Initially we review the DECIPHER system, then we show that DECIPHER's speaker-independent performance improved by 20% when the standard 3990-sentence speaker-independent test set was augmented with training data from the 7200-sentence resource management speaker-dependent training sentences. We show a further improvement of over 20% when a version of corrective training was implemented. Finally we show improvement using parallel male- and female-trained models in DECIPHER. The word-error rate when all three improvements were combined was 3.7% on DARPA's February 1989 speaker-independent test set using the standard perplexity 60 wordpair grammar.