Learning automata-based approach to learn dialogue policies in large state space

  • Authors:
  • G. Kumaravelan;R. Sivakumar

  • Affiliations:
  • Department of Computer Science, Pondicherry University, Karaikal Centre, Karaikal – 609605, India.;Department of Computer Science, AVVM Sri Pushpam College, Thanjavur – 613503, Tamilnadu, India

  • Venue:
  • International Journal of Intelligent Information and Database Systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the problem of scalable optimisation of dialogue policies in speech-based conversational systems using reinforcement learning. More specifically, for large state spaces several difficulties like large tables, an account of prior knowledge and data sparsity are faced. Hence, we present an online policy learning algorithm based on hierarchical structure learning automata using eligibility trace method to find optimal dialogue strategies that cover large state spaces. The proposed algorithm is capable of deriving an optimal policy that prescribes what action should be taken in various states of conversation so as to maximise the expected total reward to attain the goal and incorporates good exploration and exploitation in its updates to improve the naturalness of human-computer interaction. The proposed model is tested using the most sophisticated evaluation framework PARADISE for accessing the travel information system.