Multimodal interaction in speech systems
Multimedia interface design
Integration of understanding and synthesis functions for multimedia interfaces
Multimedia interface design
A computational theory of grounding in natural language conversation
A computational theory of grounding in natural language conversation
Integration and synchronization of input modes during multimodal human-computer interaction
Proceedings of the ACM SIGCHI Conference on Human factors in computing systems
Navigating the Information Superhighway Using Spoken Language Interfaces
IEEE Expert: Intelligent Systems and Their Applications
“Put-that-there”: Voice and gesture at the graphics interface
SIGGRAPH '80 Proceedings of the 7th annual conference on Computer graphics and interactive techniques
The Design and Implementation of the TRAINS-96 System: A Prototype Mixed-Initiative Planning Assistant
An evaluation of strategies for selective utterance verification for spoken natural language dialog
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Unification-based multimodal integration
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
An empirical model of acknowledgment for spoken-language systems
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Mulitmodal interaction for distributed interactive simulation
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Providing integrated toolkit-level support for ambiguity in recognition-based interfaces
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Interaction techniques for ambiguity resolution in recognition-based interfaces
UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
Something from nothing: augmenting a paper-based work practice via multimodal interaction
DARE '00 Proceedings of DARE 2000 on Designing augmented reality environments
Creating tangible interfaces by augmenting physical objects with multimodal language
Proceedings of the 6th international conference on Intelligent user interfaces
A voice and ink XML multimodal architecture for mobile e-commerce systems
WMC '02 Proceedings of the 2nd international workshop on Mobile commerce
Providing integrated toolkit-level support for ambiguity in recognition-based interfaces
CHI '00 Extended Abstracts on Human Factors in Computing Systems
Mutual disambiguation of 3D multimodal interaction in augmented and virtual reality
Proceedings of the 5th international conference on Multimodal interfaces
Guidelines for multimodal user interface design
Communications of the ACM - Multimodal interfaces that flex, adapt, and persist
Unification-based multimodal parsing
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
The CommandTalk spoken dialogue system
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Interaction techniques for ambiguity resolution in recognition-based interfaces
ACM SIGGRAPH 2006 Courses
Interaction techniques for ambiguity resolution in recognition-based interfaces
ACM SIGGRAPH 2007 courses
Hi-index | 0.00 |
Systems that attempt to understand natural human input make mistakes, even humans. However, humans avoid misunderstandings by confirming doubtful input. Multimodal systems---those that combine simultaneous input from more than one modality, for example speech and gesture-have historically been designed so that they either request confirmation of speech, their primary modality, or not at all. Instead, we experimented with delaying confirmation until after the speech and gesture were combined into a complete multimodal command. In controlled experiments, subjects achieved more commands per minute at a lower error rate when the system delayed confirmation, than compared to when subjects confirmed only speech. In addition, this style of late confirmation meets the user's expectation that confirmed commands should be executable.