Task adaptation in stochastic language models for continuous speech recognition

  • Authors:
  • Shoichi Matsunaga;Tomokazu Yamada;Kiyohiro Shikano

  • Affiliations:
  • NTT Human Interface Laboratories, Musashino-shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-shi, Tokyo, Japan

  • Venue:
  • ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes two approaches for adapting a specific syllable trigram model to a new task. One uses a small amount of text data similar to the target task, and the other uses supervised learning using the most recent input phrases. The effect of each adaptation is verified with syllable perplexity and phrase recognition. Where the syntactic knowledge was only the syllable trigram model, the perplexity was reduced from 54.5 to 18.1 for the adaptation using 100 phrases of similar text, and was reduced to 14.6 by the supervised learning. The recognition rates were also improved from 42.3% to 46.6% and 50.9%, respectively. Text similarity for speech recognition is also studied in this paper.