An Online Algorithm for Applying Reinforcement Learning to Handle Ambiguity in Spoken Dialogues

Authors:
Fangju Wang;Kyle Swegles
Affiliations:
University of Guelph, Guelph, Ontario, Canada N1G 2W1;University of Guelph, Guelph, Ontario, Canada N1G 2W1
Venue:
TAMC '09 Proceedings of the 6th Annual Conference on Theory and Applications of Models of Computation
Year:
2009

Citing 8
Cited 0

Reinforcement Learning

Reinforcement Learning
Machine Learning

Machine Learning
Automatic optimization of dialogue management

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Adaptive dialogue systems - interaction with interact

SIGDIAL '02 Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2
From vocal to multimodal dialogue management

Proceedings of the 8th international conference on Multimodal interfaces
Learning more effective dialogue strategies using limited dialogue move features

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning

HLT '02 Proceedings of the second international conference on Human Language Technology Research
A statistical approach to spoken dialog systems design and evaluation

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

Spoken dialogue systems (SDSs) have been widely used in human-computer communications, including database querying, online trouble shooting advising, etc. A major challenge in building an SDS is to handle ambiguity in natural languages. User queries, questions, descriptions in a natural language may be ambiguous. To be effective in practical applications, an SDS must be able to disambiguate input from its user(s). In our research, we develop an online algorithm for applying reinforcement learning to handle ambiguity in SDSs. We introduce a new user dialogue policy into the framework of reinforcement learning to model user dialogue behavior. Also, differing from the current reinforcement learning algorithms in speech and language processing that are characterized by offline training, our algorithm conducts both offline and online detection of user dialogue behavior. In this paper, we present the online algorithm for reinforcement learning, emphasizing the detection of user dialogue behavior. We also describe the initial implementation and experiments.