On the dynamic adaptation of language models based on dialogue information

Authors:
J. M. Lucas-Cuesta;J. Ferreiros;F. FernáNdez-MartıNez;J. D. Echeverry;S. Lutfi
Affiliations:
Speech Technology Group, Universidad Politécnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain;Speech Technology Group, Universidad Politécnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain;Speech Technology Group, Universidad Politécnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain;Speech Technology Group, Universidad Politécnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain;Speech Technology Group, Universidad Politécnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain
Venue:
Expert Systems with Applications: An International Journal
Year:
2013

Citing 15
Cited 0

A Cache-Based Natural Language Model for Speech Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
A dynamic language model for speech recognition

HLT '91 Proceedings of the workshop on Speech and Natural Language
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Specialized Language Models Using Dialogue Predictions

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Analyzing and improving statistical language models for speech recognition

Analyzing and improving statistical language models for speech recognition
Improvements in stochastic language modeling

HLT '91 Proceedings of the workshop on Speech and Natural Language
Language modeling with sentence-level mixtures

HLT '94 Proceedings of the workshop on Human Language Technology
A novel word clustering algorithm based on latent semantic analysis

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Robust dialogue-state dependent language modeling using leaving-one-out

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02
ASR post-correction for spoken dialogue systems based on semantic, syntactic, lexical and contextual information

Speech Communication
Word Segments in Category-Based Language Models for Automatic Speech Recognition

IbPRIA '07 Proceedings of the 3rd Iberian conference on Pattern Recognition and Image Analysis, Part I
MAP adaptation of stochastic grammars

Computer Speech and Language
Dynamic language modeling for European Portuguese

Computer Speech and Language
Trigger-based language models: a maximum entropy approach

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
On the dynamic adaptation of stochastic language models

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II

Quantified Score

Hi-index	12.05

Visualization

Abstract

We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.