Application of backend database contents and structure to the design of spoken dialog services

  • Authors:
  • Luis Fernando D'Haro;Ricardo de Córdoba;Juan Manuel Montero;Javier Ferreiros;José Manuel Pardo

  • Affiliations:
  • Grupo de Tecnología del Habla, Universidad Politécnica de Madrid, Madrid, Spain;Grupo de Tecnología del Habla, Universidad Politécnica de Madrid, Madrid, Spain;Grupo de Tecnología del Habla, Universidad Politécnica de Madrid, Madrid, Spain;Grupo de Tecnología del Habla, Universidad Politécnica de Madrid, Madrid, Spain;Grupo de Tecnología del Habla, Universidad Politécnica de Madrid, Madrid, Spain

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

Current development platforms for designing spoken dialog services feature different kinds of strategies to help designers build, test, and deploy their applications. In general, these platforms are made up of several assistants that handle the different design stages (e.g. definition of the dialog flow, prompt and grammar definition, database connection, or to debug and test the running of the application). In spite of all the advances in this area, in general the process of designing spoken-based dialog services is a time consuming task that needs to be accelerated. In this paper we describe a complete development platform that reduces the design time by using different types of acceleration strategies based on using information from the data model structure and database contents, as well as cumulative information obtained throughout the successive steps in the design. Thanks to these accelerations, the interaction with the platform is simplified and the design is reduced, in most cases, to simple confirmations to the ''proposals'' that the platform automatically provides at each stage. Different kinds of proposals are available to complete the application flow such as the possibility of selecting which information slots should be requested to the user together, predefined templates for common dialogs, the most probable actions that make up each state defined in the flow, different solutions to solve specific speech-modality problems such as the presentation of the lists of retrieved results after querying the backend database. The platform also includes accelerations for creating speech grammars and prompts, and the SQL queries for accessing the database at runtime. Finally, we will describe the setup and results obtained in a simultaneous summative, subjective and objective evaluations with different designers used to test the usability of the proposed accelerations as well as their contribution to reducing the design time and interaction.