Learning about voice search for spoken dialogue systems

  • Authors:
  • Rebecca J. Passonneau;Susan L. Epstein;Tiziana Ligorio;Joshua B. Gordon;Pravin Bhutada

  • Affiliations:
  • Columbia University;Hunter College of The City University of New York and The Graduate Center of The City University of New York;Hunter College of The City University of New York;Columbia University;Columbia University

  • Venue:
  • HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a Wizard-of-Oz experiment with multiple wizard subjects, each wizard viewed automated speech recognition (ASR) results for utterances whose interpretation is critical to task success: requests for books by title from a library database. To avoid non-understandings, the wizard directly queried the application database with the ASR hypothesis (voice search). To learn how to avoid misunderstandings, we investigated how wizards dealt with uncertainty in voice search results. Wizards were quite successful at selecting the correct title from query results that included a match. The most successful wizard could also tell when the query results did not contain the requested title. Our learned models of the best wizard's behavior combine features available to wizards with some that are not, such as recognition confidence and acoustic model scores.