Mutual disambiguation of recognition errors in a multimodel architecture

Authors:
Sharon Oviatt
Affiliations:
Center for Human-Computer Communication, Oregon Graduate Institute of Science and Technology, P.O. Box 91000, Portland, OR
Venue:
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Year:
1999

Citing 11
Cited 100

Synergistic use of direct manipulation and natural language

CHI '89 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Intelligent multi-media interface technology

Intelligent user interfaces
The logic of typed feature structures

The logic of typed feature structures
Integrating simultaneous input from speech, gaze, and hand gestures

Intelligent multimedia interfaces
Toward interface design for human language technology: modality and structure as determinants of linguistic complexity

ISSD-93 Selected papers presented at the international symposium on Spoken dialogue
Integration and synchronization of input modes during multimodal human-computer interaction

Proceedings of the ACM SIGCHI Conference on Human factors in computing systems
QuickSet: multimodal interaction for distributed applications

MULTIMEDIA '97 Proceedings of the fifth ACM international conference on Multimedia
Ten myths of multimodal interaction

Communications of the ACM
“Put-that-there”: Voice and gesture at the graphics interface

SIGGRAPH '80 Proceedings of the 7th annual conference on Computer graphics and interactive techniques
Unification-based multimodal integration

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Multimodal interactive maps: designing for human performance

Human-Computer Interaction

Ten myths of multimodal interaction

Communications of the ACM
Perceptual user interfaces: multimodal interfaces that process what comes naturally

Communications of the ACM
Taming recognition errors with a multimodal interface

Communications of the ACM
Suede: a Wizard of Oz prototyping tool for speech user interfaces

UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
Multimodal system processing in mobile environments

UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
Multimodal error correction for speech user interfaces

ACM Transactions on Computer-Human Interaction (TOCHI)
Towards preferences in virtual environment interfaces

EGVE '02 Proceedings of the workshop on Virtual environments 2002
Speech and gesture multimodal control of a whole Earth 3D visualization environment

VISSYM '02 Proceedings of the symposium on Data Visualisation 2002
Adaptive tools for the elderly: new devices to cope with age-induced cognitive disabilities

WUAUC'01 Proceedings of the 2001 EC/NSF workshop on Universal accessibility of ubiquitous computing: providing for the elderly
Designing robust multimodal systems for universal access

WUAUC'01 Proceedings of the 2001 EC/NSF workshop on Universal accessibility of ubiquitous computing: providing for the elderly
Multimodal human discourse: gesture and speech

ACM Transactions on Computer-Human Interaction (TOCHI)
Natural-language interfaces

CHI '00 Extended Abstracts on Human Factors in Computing Systems
On Interfaces for Mobile Information Retrieval

Mobile HCI '02 Proceedings of the 4th International Symposium on Mobile Human-Computer Interaction
Meta-level Architecture for Executing Multi-agent Scenarios

Proceedings of the 5th Pacific Rim International Workshop on Multi Agents: Intelligent Agents and Multi-Agent Systems
Multimodal interfaces

The human-computer interaction handbook
Perceptual impairments and computing technologies

The human-computer interaction handbook
Human values, ethics, and design

The human-computer interaction handbook
Architectures multi-agents génériques à base de réseaux de Pétri colorés temporisés pour la fusion multimodale en entrée

IHM '02 Proceedings of the 14th French-speaking conference on Human-computer interaction (Conférence Francophone sur l'Interaction Homme-Machine)
Advances in the robust processing of multimodal speech and pen systems

Multimodal interface for human-machine communication
The Added Value of Multimodality in the NESPOLE! Speech-to-Speech Translation System: an Experimental Study

ICMI '02 Proceedings of the 4th IEEE International Conference on Multimodal Interfaces
Advances in Robust Multimodal Interface Design

IEEE Computer Graphics and Applications
Mutual disambiguation of 3D multimodal interaction in augmented and virtual reality

Proceedings of the 5th international conference on Multimodal interfaces
Tangible multimodal interfaces for safety-critical applications

Communications of the ACM - Multimodal interfaces that flex, adapt, and persist
A probabilistic approach to reference resolution in multimodal user interfaces

Proceedings of the 9th international conference on Intelligent user interfaces
Overriding errors in a speech and gaze multimodal architecture

Proceedings of the 9th international conference on Intelligent user interfaces
Resolving ambiguities of a gaze and speech interface

Proceedings of the 2004 symposium on Eye tracking research & applications
Visually prototyping perceptual user interfaces through multimodal storyboarding

Proceedings of the 2001 workshop on Perceptive user interfaces
The efficiency of multimodal interaction for a map-based task

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A Gaze and Speech Multimodal Interface

ICDCSW '04 Proceedings of the 24th International Conference on Distributed Computing Systems Workshops - W7: EC (ICDCSW'04) - Volume 7
Robust object-identification from inaccurate recognition-based inputs

Proceedings of the working conference on Advanced visual interfaces
Finite-state multimodal parsing and understanding

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Visual and linguistic information in gesture classification

Proceedings of the 6th international conference on Multimodal interfaces
When do we interact multimodally?: cognitive load and multimodal communication patterns

Proceedings of the 6th international conference on Multimodal interfaces
Modality fusion for graphic design applications

Proceedings of the 6th international conference on Multimodal interfaces
Multimodal interaction under exerted conditions in a natural field setting

Proceedings of the 6th international conference on Multimodal interfaces
Two-way adaptation for robust input interpretation in practical multimodal conversation systems

Proceedings of the 10th international conference on Intelligent user interfaces
Linguistic theories in efficient multimodal reference resolution: an empirical investigation

Proceedings of the 10th international conference on Intelligent user interfaces
Multimodal new vocabulary recognition through speech and handwriting in a whiteboard scheduling application

Proceedings of the 10th international conference on Intelligent user interfaces
Multimodal interaction for pedestrians: an evaluation study

Proceedings of the 10th international conference on Intelligent user interfaces
Interaction techniques using prosodic features of speech and audio localization

Proceedings of the 10th international conference on Intelligent user interfaces
Conversing with the user based on eye-gaze patterns

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A study on the manipulation of 2D objects in a projector/camera-based augmented reality environment

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Evaluation of multimodal input for entering mathematical equations on the computer

CHI '05 Extended Abstracts on Human Factors in Computing Systems
Finite-state multimodal integration and understanding

Natural Language Engineering
A rapid prototyping software infrastructure for user interfaces in ubiquitous augmented reality

Personal and Ubiquitous Computing
Inferring body pose using speech content

ICMI '05 Proceedings of the 7th international conference on Multimodal interfaces
The information-theoretic analysis of unimodal interfaces and their multimodal counterparts

Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility
Multimodal user input patterns in a non-visual context

Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility
A conceptual framework for developing adaptive multimodal applications

Proceedings of the 11th international conference on Intelligent user interfaces
Speech-augmented eye gaze interaction with small closely spaced targets

Proceedings of the 2006 symposium on Eye tracking research & applications
Speech pen: predictive handwriting based on ambient multimodal recognition

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Human-centered computing: a multimedia perspective

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Human-centered design meets cognitive load theory: designing interfaces that help people think

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Collaborative multimodal photo annotation over digital paper

Proceedings of the 8th international conference on Multimodal interfaces
From vocal to multimodal dialogue management

Proceedings of the 8th international conference on Multimodal interfaces
Salience modeling based on non-verbal modalities for spoken language understanding

Proceedings of the 8th international conference on Multimodal interfaces
Visual and linguistic information in gesture classification

ACM SIGGRAPH 2006 Courses
A salience driven approach to robust input interpretation in multimodal conversational systems

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Towards a taxonomy of error-handling strategies in recognition-based multi-modal human-computer interfaces

Signal Processing - Special section: Multimodal human-computer interfaces
Visual and linguistic information in gesture classification

ACM SIGGRAPH 2007 courses
Multimodal human-computer interaction: A survey

Computer Vision and Image Understanding
The catchment feature model: a device for multimodal fusion and a bridge between signal and sense

EURASIP Journal on Applied Signal Processing
Generic multimedia multimodal agents paradigms and their dynamic reconfiguration at the architectural level

EURASIP Journal on Applied Signal Processing
An efficient unification-based multimodal language processor in multimodal input fusion

OZCHI '07 Proceedings of the 19th Australasian conference on Computer-Human Interaction: Entertaining User Interfaces
Implicit interaction for pro-active assistance in a context-adaptive warehouse application

Mobility '07 Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technology
Beyond attention: the role of deictic gesture in intention recognition in multimodal conversational interfaces

Proceedings of the 13th international conference on Intelligent user interfaces
HCI Beyond the GUI: Design for Haptic, Speech, Olfactory, and Other Nontraditional Interfaces

HCI Beyond the GUI: Design for Haptic, Speech, Olfactory, and Other Nontraditional Interfaces
A pen and speech-based storytelling system for Chinese children

Computers in Human Behavior
Multimodal Interaction: Real Context Studies on Mobile Digital Artefacts

HAID '08 Proceedings of the 3rd international workshop on Haptic and Audio Interaction Design
Search Vox: leveraging multimodal refinement and partial knowledge for mobile voice search

Proceedings of the 21st annual ACM symposium on User interface software and technology
Explorative studies on multimodal interaction in a PDA- and desktop-based scenario

ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces
Robust gesture processing for multimodal interaction

ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces
Designing the user interface for multimodal speech and pen-based gesture applications: state-of-the-art systems and future research directions

Human-Computer Interaction
Context shifts: extending the meanings of physical objects with language

Human-Computer Interaction
Interaction issues in context-aware intelligent environments

Human-Computer Interaction
Skipping spare information in multimodal inputs during multimodal input fusion

Proceedings of the 14th international conference on Intelligent user interfaces
Investigating microphone efficacy for facilitation of mobile speech-based data entry

BCS-HCI '07 Proceedings of the 21st British HCI Group Annual Conference on People and Computers: HCI...but not as we know it - Volume 1
Usability framework for the design and evaluation of multimodal interaction

BCS-HCI '08 Proceedings of the 22nd British HCI Group Annual Conference on People and Computers: Culture, Creativity, Interaction - Volume 2
Understanding users' perception of speech recognition errors in mobile communication

International Journal of Mobile Learning and Organisation
Error Rate in Multimodal Mobility Systems

Proceedings of the 2006 conference on Advances in Intelligent IT: Active Media Technology 2006
Gesture improves coreference resolution

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
A Story Authoring System for Children

Edutainment '09 Proceedings of the 4th International Conference on E-Learning and Games: Learning by Playing. Game-based Education System Design and Development
Robust understanding in multimodal interfaces

Computational Linguistics
Cognitive principles in robust multimodal interpretation

Journal of Artificial Intelligence Research
Multimodal interaction: a new focal area for AI

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Between linguistic attention and gaze fixations inmultimodal conversational interfaces

Proceedings of the 2009 international conference on Multimodal interfaces
Multimodal inference for driver-vehicle interaction

Proceedings of the 2009 international conference on Multimodal interfaces
Evaluating the benefits of multimodal interface design for CoMPASS--a mobile GIS

Geoinformatica
Multimodal interfaces: Challenges and perspectives

Journal of Ambient Intelligence and Smart Environments
Gaze as a supplementary modality for interacting with ambient intelligence environments

UAHCI'07 Proceedings of the 4th international conference on Universal access in human-computer interaction: ambient interaction
Exploiting speech-gesture correlation in multimodal interaction

HCI'07 Proceedings of the 12th international conference on Human-computer interaction: intelligent multimodal interaction environments
A multimodal 3D storytelling system for Chinese children

Edutainment'07 Proceedings of the 2nd international conference on Technologies for e-learning and digital entertainment
Spoken and multimodal communication systems in mobile settings

COST 2102'07 Proceedings of the 2007 COST action 2102 international conference on Verbal and nonverbal communication behaviours
A framework for robust and flexible handling of inputs with uncertainty

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Human-centered visualization environments

Human-centered visualization environments
Developing accessible TV applications

The proceedings of the 13th international ACM SIGACCESS conference on Computers and accessibility
Multimodal architectures: issues and experiences

OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part I
Integrating intra and extra gestures into a mobile and multimodal shopping assistant

PERVASIVE'05 Proceedings of the Third international conference on Pervasive Computing
Multimodal interfaces: Challenges and perspectives

Journal of Ambient Intelligence and Smart Environments
An exploration of gesture-speech multimodal patterns for touch interfaces

Proceedings of the 3rd International Conference on Human Computer Interaction

Quantified Score

Hi-index	0.05

Visualization

Abstract

As a new generation of multimodal/media systems begins to defineitself, researchers are attempting to learn how to combinedifferent modes into strategically integrated whole systems. Intheory, well designed multimodal systems should be able tointegrate complementary modalities in a manner that supports mutualdisambiguation (MD) of errors and leads to more robust performance.In this study, over 2,000 multimodal utterances by both native andaccented speakers of English were processed by a multimodal system,and then logged and analyzed. The results confirmed that multimodalsystems can indeed support significant levels of MD, and alsohigher levels of MD for the more challenging accented users. As aresult, although speech recognition as a stand-alone performed farmore poorly for accented speakers, their multimodal recognitionrates did not differ from those of native speakers. Implicationsare discussed for the development of future multimodalarchitectures that can perform in a more robust and stable mannerthan individual recognition technologies. Also discussed is thedesign of interfaces that support diversity in tangible ways, andthat function well under challenging real-world usageconditions,