A probabilistic approach to reference resolution in multimodal user interfaces

  • Authors:
  • Joyce Y. Chai;Pengyu Hong;Michelle X. Zhou

  • Affiliations:
  • Michigan State University, East Lansing, MI;Harvard University, Cambridge, MA;IBM T. J. Watson Research Center, Hawthorne, NY

  • Venue:
  • Proceedings of the 9th international conference on Intelligent user interfaces
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Multimodal user interfaces allow users to interact with computers through multiple modalities, such as speech, gesture, and gaze. To be effective, multimodal user interfaces must correctly identify all objects which users refer to in their inputs. To systematically resolve different types of references, we have developed a probabilistic approach that uses a graph-matching algorithm. Our approach identifies the most probable referents by optimizing the satisfaction of semantic, temporal, and contextual constraints simultaneously. Our preliminary user study results indicate that our approach can successfully resolve a wide variety of referring expressions, ranging from simple to complex and from precise to ambiguous ones.