Using Graphical Models for an Intelligent Mixed-Initiative Dialog Management System

Authors:
Stefan Schwärzler;Günther Ruske;Frank Wallhoff;Gerhard Rigoll
Affiliations:
Institute for Human-Machine Communication, Technische Universität München, Munich, Germany 80290;Institute for Human-Machine Communication, Technische Universität München, Munich, Germany 80290;Institute for Human-Machine Communication, Technische Universität München, Munich, Germany 80290;Institute for Human-Machine Communication, Technische Universität München, Munich, Germany 80290
Venue:
Proceedings of the Symposium on Human Interface 2009 on Human Interface and the Management of Information. Information and Interaction. Part II: Held as part of HCI International 2009
Year:
2009

Citing 2
Cited 0

Spoken Dialogue Technology

Spoken Dialogue Technology
Using machine learning to explore human multimodal clarification strategies

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions

Quantified Score

Hi-index	0.00

Visualization

Abstract

The main goal of dialog management is to provide all information needed to perform e. g. a SQL-query, a navigation task, etc. Two principal approaches for dialog management systems exist: system directed ones and mixed-initiative ones. In this paper, we combine both approaches mentioned above in a novel way, and address the problem of natural intuitive dialog management. The objective of our approach is to provide a natural dialog flow. The whole dialog is therefore represented in a finite state machine: the information gathered during the dialog is represented in the states of the finite state machine; the transitions within the state machine denote the dialog steps into which the dialog is separated. The information is obtained from each natural spoken sentence by hierarchical decoding into tags, e. g. the name-tag and the address-tag. These information tags are gathered during the dialog; either by human initiative or by distinct questioning by the dialog manager. The models use information from the semantic information tags, the dialog history, and the training corpus. From all these integrated parts we achieve the best path to the end of the dialog by Viterbi decoding through the transition network after each information step. From the Air Travel Information System (ATIS) database, we extract all 21650 naturally spoken questions and the SQL-queries as answers for the trainings phase. The experiments have been realized on 200 automatically generated dialog sentences. The system obtains the semantic information in all test-sentences and leads the dialogs successfully to the end. In 66.5% of the sample dialogs we achieve the minimum of the required dialog steps. Hence, 33.5% of the dialogs have over-length.