Dialog in the open world: platform and applications

Authors:
Dan Bohus;Eric Horvitz
Affiliations:
Microsoft Research, Redmond, WA, USA;Microsoft Research, Redmond, WA, USA
Venue:
Proceedings of the 2009 international conference on Multimodal interfaces
Year:
2009

Citing 12
Cited 15

TRIPs: an integrated intelligent problem-solving assistant

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Embodied agents for multi-party dialogue in immersive virtual worlds

Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
Collagen: applying collaborative discourse theory to human-computer interaction

AI Magazine
BusyBody: creating and fielding personalized models of the cost of interruption

CSCW '04 Proceedings of the 2004 ACM conference on Computer supported cooperative work
A model of attention and interest using Gaze behavior

Lecture Notes in Computer Science
Robust Visual Tracking via Pixel Classification and Integration

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 03
The RavenClaw dialog management framework: Architecture and systems

Computer Speech and Language
Optimizing endpointing thresholds using dialogue features in a spoken dialogue system

SIGdial '08 Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue
Models for multiparty engagement in open-world dialog

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Learning to predict engagement with a spoken dialog system in open-world settings

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Coordinate: probabilistic forecasting of presence and availability

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Real-Time Bayesian 3-D Pose Tracking

IEEE Transactions on Circuits and Systems for Video Technology

Towards relational POMDPs for adaptive dialogue management

ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Facilitating multiparty dialog with gaze, gesture, and speech

International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Data miming: inferring spatial object descriptions from human gesture

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Multiparty turn taking in situated dialog: study, lessons, and directions

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Multimodal cue detection engine for orchestrated entertainment

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Two people walk into a bar: dynamic multi-party social interaction with a robot agent

Proceedings of the 14th ACM international conference on Multimodal interaction
Learning speaker, addressee and overlap detection models from multimodal streams

Proceedings of the 14th ACM international conference on Multimodal interaction
Using group history to identify character-directed utterances in multi-child interactions

SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Attention-based addressee selection for service and social robots to interact with multiple persons

Proceedings of the Workshop at SIGGRAPH Asia
Execution memory for grounding and coordination

Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Triggering effective social support for online groups

ACM Transactions on Interactive Intelligent Systems (TiiS)
Comparing task-based and socially intelligent behaviour in a robot bartender

Proceedings of the 15th ACM on International conference on multimodal interaction
Implementation and evaluation of a multimodal addressee identification mechanism for multiparty conversation systems

Proceedings of the 15th ACM on International conference on multimodal interaction
How can i help you': comparing engagement classification strategies for a robot bartender

Proceedings of the 15th ACM on International conference on multimodal interaction
A dominance estimation mechanism using eye-gaze and turn-taking information

Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

We review key challenges of developing spoken dialog systems that can engage in interactions with one or multiple participants in relatively unconstrained environments. We outline a set of core competencies for open-world dialog, and describe three prototype systems. The systems are built on a common underlying conversational framework which integrates an array of predictive models and component technologies, including speech recognition, head and pose tracking, probabilistic models for scene analysis, multiparty engagement and turn taking, and inferences about user goals and activities. We discuss the current models and showcase their function by means of a sample recorded interaction, and we review results from an observational study of open-world, multiparty dialog in the wild.