Semantics-based representation for multimodal interpretation in conversational systems

Authors:
Joyce Chai
Affiliations:
IBM T. J. Watson Research Center, Hawthorne, NY
Venue:
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Year:
2002

Citing 9
Cited 4

Attention, intentions, and the structure of discourse

Computational Linguistics
The logic of typed feature structures

The logic of typed feature structures
QuickSet: multimodal interaction for distributed applications

MULTIMEDIA '97 Proceedings of the fifth ACM international conference on Multimedia
User and discourse models for multimodal communication

Readings in intelligent user interfaces
Automated authoring of coherent multimedia discourse in conversation systems

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Cognitive Status and Form of Reference in Multimodal Human-Computer Interaction

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
“Put-that-there”: Voice and gesture at the graphics interface

SIGGRAPH '80 Proceedings of the 7th annual conference on Computer graphics and interactive techniques
Unification-based multimodal integration

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Unification-based multimodal parsing

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1

References to graphical objects in interactive multimodal queries

Knowledge-Based Systems
MultiML: a general purpose representation language for multimodal human utterances

ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces
A multimodal reference resolution approach in virtual environment

VSMM'06 Proceedings of the 12th international conference on Interactive Technologies and Sociotechnical Systems
Multi levels semantic architecture for multimodal interaction

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

To support context-based multimodal interpretation in conversational systems, we have developed a semantics-based representation to capture salient information from user inputs and the overall conversation. In particular, we present three unique characteristics: fine-grained semantic models, flexible composition of feature structures, and consistent representation at multiple levels. This representation allows our system to use rich contexts to resolve ambiguities, infer unspecified information, and improve multimodal alignment. As a result, our system is able to enhance understanding of multimodal inputs including those abbreviated, imprecise, or complex ones.