Bootstrapping spoken dialogue systems by exploiting reusable libraries

  • Authors:
  • Giuseppe Di fabbrizio;Gokhan Tur;Dilek Hakkani-tÜr;Mazin Gilbert;Bernard Renger;David Gibbon;Zhu Liu;Behzad Shahraray

  • Affiliations:
  • At&t labs—research, 180 park avenue, florham park, nj 07932, usa e-mail: pino@research.att.com, gtur@research.att.com, dtur@research.att.com, mazin@research.att.com, renger@research.att. ...;At&t labs—research, 180 park avenue, florham park, nj 07932, usa e-mail: pino@research.att.com, gtur@research.att.com, dtur@research.att.com, mazin@research.att.com, renger@research.att. ...;At&t labs—research, 180 park avenue, florham park, nj 07932, usa e-mail: pino@research.att.com, gtur@research.att.com, dtur@research.att.com, mazin@research.att.com, renger@research.att. ...;At&t labs—research, 180 park avenue, florham park, nj 07932, usa e-mail: pino@research.att.com, gtur@research.att.com, dtur@research.att.com, mazin@research.att.com, renger@research.att. ...;At&t labs—research, 180 park avenue, florham park, nj 07932, usa e-mail: pino@research.att.com, gtur@research.att.com, dtur@research.att.com, mazin@research.att.com, renger@research.att. ...;At&t labs—research, 200 laurel avenue south, middletown, nj 07748, usa e-mail: dcg@research.att.com, zliu@research.att.com, behzad@research.att.com;At&t labs—research, 200 laurel avenue south, middletown, nj 07748, usa e-mail: dcg@research.att.com, zliu@research.att.com, behzad@research.att.com;At&t labs—research, 200 laurel avenue south, middletown, nj 07748, usa e-mail: dcg@research.att.com, zliu@research.att.com, behzad@research.att.com

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Building natural language spoken dialogue systems requires large amounts of human transcribed and labeled speech utterances to reach useful operational service performances. Furthermore, the design of such complex systems consists of several manual steps. The User Experience (UE) expert analyzes and defines by hand the system core functionalities: the system semantic scope (call-types) and the dialogue manager strategy that will drive the human–machine interaction. This approach is extensive and error-prone since it involves several nontrivial design decisions that can be evaluated only after the actual system deployment. Moreover, scalability is compromised by time, costs, and the high level of UE know-how needed to reach a consistent design. We propose a novel approach for bootstrapping spoken dialogue systems based on the reuse of existing transcribed and labeled data, common reusable dialogue templates, generic language and understanding models, and a consistent design process. We demonstrate that our approach reduces design and development time while providing an effective system without any application-specific data.