Sparse approximate dynamic programming for dialog management

  • Authors:
  • Senthilkumar Chandramohan;Matthieu Geist;Olivier Pietquin

  • Affiliations:
  • SUPELEC - IMS Research Group, Metz - France;SUPELEC - IMS Research Group, Metz - France;SUPELEC - IMS Research Group, Metz - France

  • Venue:
  • SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the complexity implied by the required naturalness of dialogue systems and the inability of standard RL algorithms to scale up. Another issue is the sparsity of the data available for training in the dialogue domain which can not ensure convergence of most of RL algorithms. In this paper, we propose to combine a sample-efficient generalization framework for RL with a feature selection algorithm for the learning of an optimal spoken dialogue management strategy.