Task adaptation in stochastic language models for continuous speech recognition

Authors:
Shoichi Matsunaga;Tomokazu Yamada;Kiyohiro Shikano
Affiliations:
NTT Human Interface Laboratories, Musashino-shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-shi, Tokyo, Japan
Venue:
ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Year:
1992

Citing 2
Cited 0

A Cache-Based Natural Language Model for Speech Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Phonetic typewriter based on phoneme source modeling

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes two approaches for adapting a specific syllable trigram model to a new task. One uses a small amount of text data similar to the target task, and the other uses supervised learning using the most recent input phrases. The effect of each adaptation is verified with syllable perplexity and phrase recognition. Where the syntactic knowledge was only the syllable trigram model, the perplexity was reduced from 54.5 to 18.1 for the adaptation using 100 phrases of similar text, and was reduced to 14.6 by the supervised learning. The recognition rates were also improved from 42.3% to 46.6% and 50.9%, respectively. Text similarity for speech recognition is also studied in this paper.