Social signal and user adaptation in reinforcement learning-based dialogue management

Authors:
Emmanuel Ferreira;Fabrice Lefèvre
Affiliations:
University of Avignon, Avignon, France;University of Avignon, Avignon, France
Venue:
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Year:
2013

Citing 17
Cited 1

Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
PARADISE: a framework for evaluating spoken dialogue agents

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Spoken dialogue management using probabilistic reasoning

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies

The Knowledge Engineering Review
On the role of tracking in stationary environments

Proceedings of the 24th international conference on Machine learning
The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management

Computer Speech and Language
Social signal processing: Survey of an emerging domain

Image and Vision Computing
Anytime point-based approximations for large POMDPs

Journal of Artificial Intelligence Research
A Bayesian approach to imitation in reinforcement learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Planning and acting in partially observable stochastic domains

Artificial Intelligence
Tracking in Reinforcement Learning

ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part I
Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

Computer Speech and Language
Emotion and reinforcement: affective facial expressions facilitate robot learning

ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
Parameter estimation for agenda-based user simulation

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Kalman temporal differences

Journal of Artificial Intelligence Research
Reinforcement Learning: An Introduction

IEEE Transactions on Neural Networks
Paralinguistics in speech and language-State-of-the-art and the challenge

Computer Speech and Language

Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper investigates the conditions under which cues from social signals can be used for user adaptation (or user tracking) of a learning agent. In this work we consider the case of the Reinforcement Learning (RL) of a dialogue management module. Social signals (gazes, postures, emotions, etc.) have an undeniable importance in human interactions and can be used as an additional and user-dependent (subjective) reinforcement signal during learning. In this paper, the Kalman Temporal Differences (KTD) framework is employed in combination with a potential-based shaping reward method to properly integrate the social information in the optimisation procedure and adapt the policy to user profiles. In a second step the ability of the method to track a new user profile (after self learning of the user or switch to a new user) is shown. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework with simulations support our claims.