Vocabulary learning and environment normalization in vocabulary-independent speech recognition

  • Authors:
  • Hsiao-Wuen Hon;Kai-Fu Lee

  • Affiliations:
  • School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania;Apple Computer, Inc., Cupertino, CA

  • Venue:
  • ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we are looking into the adaptation issues of vocabulary independent (VI) systems. Just as with speaker-adaptation in speaker-independent system, two vocabulary learning algorithms are implemented in order to tailor the VI subword models to the target vocabulary. The first algorithm is to generate vocabularyadapted clustering decision trees by focusing on relevant allophones during tree generation and reduces the VI error rate by 9%. The second algorithm, vocabulary-bias training, is to give the relevant allophones more prominence by assign more weight to them during Baum-Welch training of the generalized allophonic models and reduces the VI error rate by 15%. Finally, in order to overcome the degradation causing by the different acoustic environments used for VI training and testing, CDCN and ISDCN originally designed for microphone adaptation are incorporated into our VI system and both reduce the degradation of VI cross-environment recognition by 50%.