Combining incremental language generation and incremental speech synthesis for adaptive information presentation

  • Authors:
  • Hendrik Buschmeier;Timo Baumann;Benjamin Dosch;Stefan Kopp;David Schlangen

  • Affiliations:
  • Bielefeld University;University of Hamburg;Bielefeld University;Bielefeld University;Bielefeld University

  • Venue:
  • SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Participants in a conversation are normally receptive to their surroundings and their interlocutors, even while they are speaking and can, if necessary, adapt their ongoing utterance. Typical dialogue systems are not receptive and cannot adapt while uttering. We present combinable components for incremental natural language generation and incremental speech synthesis and demonstrate the flexibility they can achieve with an example system that adapts to a listener's acoustic understanding problems by pausing, repeating and possibly rephrasing problematic parts of an utterance. In an evaluation, this system was rated as significantly more natural than two systems representing the current state of the art that either ignore the interrupting event or just pause; it also has a lower response time.