TRIPs: an integrated intelligent problem-solving assistant
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Embodied agents for multi-party dialogue in immersive virtual worlds
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2
BusyBody: creating and fielding personalized models of the cost of interruption
CSCW '04 Proceedings of the 2004 ACM conference on Computer supported cooperative work
A model of attention and interest using Gaze behavior
Lecture Notes in Computer Science
Robust Visual Tracking via Pixel Classification and Integration
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 03
The RavenClaw dialog management framework: Architecture and systems
Computer Speech and Language
Optimizing endpointing thresholds using dialogue features in a spoken dialogue system
SIGdial '08 Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue
Models for multiparty engagement in open-world dialog
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Learning to predict engagement with a spoken dialog system in open-world settings
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Coordinate: probabilistic forecasting of presence and availability
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Real-Time Bayesian 3-D Pose Tracking
IEEE Transactions on Circuits and Systems for Video Technology
Towards relational POMDPs for adaptive dialogue management
ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Facilitating multiparty dialog with gaze, gesture, and speech
International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Data miming: inferring spatial object descriptions from human gesture
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Multiparty turn taking in situated dialog: study, lessons, and directions
SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Multimodal cue detection engine for orchestrated entertainment
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Two people walk into a bar: dynamic multi-party social interaction with a robot agent
Proceedings of the 14th ACM international conference on Multimodal interaction
Learning speaker, addressee and overlap detection models from multimodal streams
Proceedings of the 14th ACM international conference on Multimodal interaction
Using group history to identify character-directed utterances in multi-child interactions
SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Attention-based addressee selection for service and social robots to interact with multiple persons
Proceedings of the Workshop at SIGGRAPH Asia
Execution memory for grounding and coordination
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Triggering effective social support for online groups
ACM Transactions on Interactive Intelligent Systems (TiiS)
Comparing task-based and socially intelligent behaviour in a robot bartender
Proceedings of the 15th ACM on International conference on multimodal interaction
Proceedings of the 15th ACM on International conference on multimodal interaction
How can i help you': comparing engagement classification strategies for a robot bartender
Proceedings of the 15th ACM on International conference on multimodal interaction
A dominance estimation mechanism using eye-gaze and turn-taking information
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction
Hi-index | 0.00 |
We review key challenges of developing spoken dialog systems that can engage in interactions with one or multiple participants in relatively unconstrained environments. We outline a set of core competencies for open-world dialog, and describe three prototype systems. The systems are built on a common underlying conversational framework which integrates an array of predictive models and component technologies, including speech recognition, head and pose tracking, probabilistic models for scene analysis, multiparty engagement and turn taking, and inferences about user goals and activities. We discuss the current models and showcase their function by means of a sample recorded interaction, and we review results from an observational study of open-world, multiparty dialog in the wild.