Construction of corpus-based syntactic rules for accurate speech recognition

Authors:
Junko Hosaka;Toshiyuki Takezawa
Affiliations:
ATR Interpreting Telephony Research Laboratories, Kyoto, Japan;ATR Interpreting Telephony Research Laboratories, Kyoto, Japan
Venue:
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Year:
1992

Citing 2
Cited 2

Measuring information provided by language model and acoustic model in probabilistic speech recognition: theory and experimental results

Speech Communication
A stochastic approach to sentence parsing

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics

Pause as a phrase demarcator for speech and language processing

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
ATREUS: a comparative study of continuous speech recognition systems at ATR

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper describes the syntactic rules which are applied in the Japanese speech recognition module of a speech-to-speech translation system. Japanese is considered to be a free word/phrase order language. Since syntactic rules are applied as constraints to reduce the search space in speech recognition, applying rules which take into account all possible phrase orders can have almost the same effect as using no constraints. Instead, we take into consideration the recognition weaknesses of certain syntactic categories and treat them precisely, so that a minimal number of rules can work most effectively. In this paper we first examine which syntactic categories are easily misrecognized. Second, we consult our dialogue corpus, in order to provide the rules with great generality. Based on both studies, we refine the rules. Finally, we verify the validity of the refinement through speech recognition experiments.