A two-tier user simulation model for reinforcement learning of adaptive referring expression generation policies

Authors:
Srinivasan Janarthanam;Oliver Lemon
Affiliations:
University of Edinburgh;University of Edinburgh
Venue:
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Year:
2009

Citing 5
Cited 10

A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies

The Knowledge Engineering Review
Evaluating user simulations with the Cramér-von Mises divergence

Speech Communication
Learning lexical alignment policies for generating referring expressions in spoken dialogue systems

ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
A wizard-of-oz environment to study referring expression generation in a situated spoken dialogue task

ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
Agenda-based user simulation for bootstrapping a POMDP dialogue system

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers

Learning to adapt to unknown users: referring expression generation in spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Optimising information presentation for spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Generation under uncertainty

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Learning adaptive referring expression generation policies for spoken dialogue systems

Empirical methods in natural language generation
Simulation of the grounding process in spoken dialog systems with Bayesian networks

IWSDS'10 Proceedings of the Second international conference on Spoken dialogue systems for ambient environments
Learning dialogue strategies from older and younger simulated users

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Adaptive referring expression generation in spoken dialogue systems: evaluation with real users

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Assessing user simulation for dialog systems using human judges and automatic evaluation measures

Natural Language Engineering
A comparative study of reinforcement learning techniques on dialogue management

EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Generative goal-driven user simulation for dialog management

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a new two-tier user simulation model for learning adaptive referring expression generation (REG) policies for spoken dialogue systems using reinforcement learning. Current user simulation models that are used for dialogue policy learning do not simulate users with different levels of domain expertise and are not responsive to referring expressions used by the system. The two-tier model displays these features, that are crucial to learning an adaptive REG policy. We also show that the two-tier model simulates real user behaviour more closely than other baseline models, using the dialogue similarity measure based on Kullback-Leibler divergence.