Hierarchical reinforcement learning and hidden Markov models for task-oriented natural language generation

Authors:
Nina Dethlefs;Heriberto Cuayáhuitl
Affiliations:
University of Bremen;German Research Centre for Artificial Intelligence (DFKI), Saarbrücken
Venue:
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Year:
2011

Citing 16
Cited 5

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Bootstrapping Syntax and Recursion using Alginment-Based Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
PARADISE: a framework for evaluating spoken dialogue agents

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Generation that exploits corpus-based statistical knowledge

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Exploiting a probabilistic hierarchical model for generation

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Stochastic language generation for spoken dialogue systems

ANLP/NAACL-ConvSyst '00 Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems - Volume 3
Bootstrapping lexical choice via multiple-sequence alignment

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Automatic generation of weather forecast texts using comprehensive probabilistic generation-space models

Natural Language Engineering
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research
Learning to adapt to unknown users: referring expression generation in spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Optimising information presentation for spoken dialogue systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Phrase-based statistical language generation using graphical models and active learning

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A simple domain-independent probabilistic approach to generation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Hierarchical reinforcement learning for adaptive text generation

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
The first challenge on generating instructions in virtual environments

Empirical methods in natural language generation
Optimising natural language generation decision making for situated dialogue

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference

Optimising natural language generation decision making for situated dialogue

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Adaptive information presentation for spoken dialogue systems: evaluation with human subjects

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
The GRUVE challenge: generating routes under uncertainty in virtual environments

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
The Bremen system for the GIVE-2.5 challenge

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
Comparing HMMs and Bayesian networks for surface realisation

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Surface realisation decisions in language generation can be sensitive to a language model, but also to decisions of content selection. We therefore propose the joint optimisation of content selection and surface realisation using Hierarchical Reinforcement Learning (HRL). To this end, we suggest a novel reward function that is induced from human data and is especially suited for surface realisation. It is based on a generation space in the form of a Hidden Markov Model (HMM). Results in terms of task success and human-likeness suggest that our unified approach performs better than greedy or random baselines.