Confirmation in multimodal systems

Authors:
David R. McGee;Philip R. Cohen;Sharon Oviatt
Affiliations:
Center for Human-computer Communication, Oregon Graduate Institute, Portland, Oregon;Center for Human-computer Communication, Oregon Graduate Institute, Portland, Oregon;Center for Human-computer Communication, Oregon Graduate Institute, Portland, Oregon
Venue:
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Year:
1998

Citing 11
Cited 13

Multimodal interaction in speech systems

Multimedia interface design
Integration of understanding and synthesis functions for multimedia interfaces

Multimedia interface design
A computational theory of grounding in natural language conversation

A computational theory of grounding in natural language conversation
Integration and synchronization of input modes during multimodal human-computer interaction

Proceedings of the ACM SIGCHI Conference on Human factors in computing systems
Navigating the Information Superhighway Using Spoken Language Interfaces

IEEE Expert: Intelligent Systems and Their Applications
“Put-that-there”: Voice and gesture at the graphics interface

SIGGRAPH '80 Proceedings of the 7th annual conference on Computer graphics and interactive techniques
The Design and Implementation of the TRAINS-96 System: A Prototype Mixed-Initiative Planning Assistant

The Design and Implementation of the TRAINS-96 System: A Prototype Mixed-Initiative Planning Assistant
An evaluation of strategies for selective utterance verification for spoken natural language dialog

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Unification-based multimodal integration

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
An empirical model of acknowledgment for spoken-language systems

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Mulitmodal interaction for distributed interactive simulation

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence

Providing integrated toolkit-level support for ambiguity in recognition-based interfaces

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Interaction techniques for ambiguity resolution in recognition-based interfaces

UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
Something from nothing: augmenting a paper-based work practice via multimodal interaction

DARE '00 Proceedings of DARE 2000 on Designing augmented reality environments
Creating tangible interfaces by augmenting physical objects with multimodal language

Proceedings of the 6th international conference on Intelligent user interfaces
A voice and ink XML multimodal architecture for mobile e-commerce systems

WMC '02 Proceedings of the 2nd international workshop on Mobile commerce
Providing integrated toolkit-level support for ambiguity in recognition-based interfaces

CHI '00 Extended Abstracts on Human Factors in Computing Systems
Mutual disambiguation of 3D multimodal interaction in augmented and virtual reality

Proceedings of the 5th international conference on Multimodal interfaces
Guidelines for multimodal user interface design

Communications of the ACM - Multimodal interfaces that flex, adapt, and persist
Unification-based multimodal parsing

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
The CommandTalk spoken dialogue system

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Interaction techniques for ambiguity resolution in recognition-based interfaces

ACM SIGGRAPH 2006 Courses
Interaction techniques for ambiguity resolution in recognition-based interfaces

ACM SIGGRAPH 2007 courses
Designing the user interface for multimodal speech and pen-based gesture applications: state-of-the-art systems and future research directions

Human-Computer Interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

Systems that attempt to understand natural human input make mistakes, even humans. However, humans avoid misunderstandings by confirming doubtful input. Multimodal systems---those that combine simultaneous input from more than one modality, for example speech and gesture-have historically been designed so that they either request confirmation of speech, their primary modality, or not at all. Instead, we experimented with delaying confirmation until after the speech and gesture were combined into a complete multimodal command. In controlled experiments, subjects achieved more commands per minute at a lower error rate when the system delayed confirmation, than compared to when subjects confirmed only speech. In addition, this style of late confirmation meets the user's expectation that confirmed commands should be executable.