An Online Algorithm for Applying Reinforcement Learning to Handle Ambiguity in Spoken Dialogues

  • Authors:
  • Fangju Wang;Kyle Swegles

  • Affiliations:
  • University of Guelph, Guelph, Ontario, Canada N1G 2W1;University of Guelph, Guelph, Ontario, Canada N1G 2W1

  • Venue:
  • TAMC '09 Proceedings of the 6th Annual Conference on Theory and Applications of Models of Computation
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spoken dialogue systems (SDSs) have been widely used in human-computer communications, including database querying, online trouble shooting advising, etc. A major challenge in building an SDS is to handle ambiguity in natural languages. User queries, questions, descriptions in a natural language may be ambiguous. To be effective in practical applications, an SDS must be able to disambiguate input from its user(s). In our research, we develop an online algorithm for applying reinforcement learning to handle ambiguity in SDSs. We introduce a new user dialogue policy into the framework of reinforcement learning to model user dialogue behavior. Also, differing from the current reinforcement learning algorithms in speech and language processing that are characterized by offline training, our algorithm conducts both offline and online detection of user dialogue behavior. In this paper, we present the online algorithm for reinforcement learning, emphasizing the detection of user dialogue behavior. We also describe the initial implementation and experiments.