Assessment of dialogue systems by means of a new simulation technique

  • Authors:
  • R. López-Cózar;A. De la Torre;J. C. Segura;A. J. Rubio

  • Affiliations:
  • Dpto. Electrónica y Tecnología de Computadores, Universidad de Granada, 18071 Granada, Spain;Dpto. Electrónica y Tecnología de Computadores, Universidad de Granada, 18071 Granada, Spain;Dpto. Electrónica y Tecnología de Computadores, Universidad de Granada, 18071 Granada, Spain;Dpto. Electrónica y Tecnología de Computadores, Universidad de Granada, 18071 Granada, Spain

  • Venue:
  • Speech Communication
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, a question of great interest has been the development of tools and techniqnes to facilitate the evaluation of dialogue systems. The latter can be evaluated from various points of view, such as recognition and understanding rates, dialogue naturalness and robustness against recognition errors. Evaluation usually requires compiling a large corpus of words and sentences uttered by users, relevant to the application domain the system is designed for. This paper proposes a new technique that makes it possible to reuse such a corpus for the evaluation and to check the performance of the system when different dialogue strategies are used. The technique is based on the automatic generation of conversations between the dialogue system, together with an additional dialogue system called user simulator that represents the user's interaction with the dialogue system. The technique has been applied to evaluate a dialogue system developed in our lab using two different recognition front-ends and two different dialogue strategies to handle user confirmations. The experiments show that the prompt-dependent recognition front-end achieves better results, but that this front-end is appropriate only if users limit their utterances to those related to the current system prompt. The prompt-independent front-end achieves inferior results, but enables front-end users to utter any permitted utterance at any time, irrespective of the system prompt. In consequence, this front-end may allow a more natural and comfortable interaction. The experiments also show that the re-prompting confirmation strategy enhances system performance for both recognition front-ends.