Multimodal Human Machine Interactions in Virtual and Augmented Reality

Authors:
Gérard Chollet;Anna Esposito;Annie Gentes;Patrick Horain;Walid Karam;Zhenbo Li;Catherine Pelachaud;Patrick Perrot;Dijana Petrovska-Delacrétaz;Dianle Zhou;Leila Zouari
Affiliations:
CNRS-LTCI TELECOM-ParisTech, Paris, France 75634;Dept. of Psycology, and IIASS, Second University of Naples, Italy;CNRS-LTCI TELECOM-ParisTech, Paris, France 75634;TELECOM & Management SudParis, Evry, France;CNRS-LTCI TELECOM-ParisTech, Paris, France 75634;TELECOM & Management SudParis, Evry, France;CNRS-LTCI TELECOM-ParisTech, Paris, France 75634 and LINC, IUT de Montreuil, Université de Paris 8, Montreuil, France 93100;CNRS-LTCI TELECOM-ParisTech, Paris, France 75634 and Institut de Recherche Criminelle de la Gendarmerie Nationale (IRCGN), Rosny sous Bois, France;TELECOM & Management SudParis, Evry, France;TELECOM & Management SudParis, Evry, France;CNRS-LTCI TELECOM-ParisTech, Paris, France 75634
Venue:
Multimodal Signals: Cognitive and Algorithmic Issues
Year:
2009

Citing 47
Cited 3

Embodiment in conversational interfaces: Rea

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Speaker transformation algorithm using segmental codebooks (STASC)

Speech Communication
The EMOTE model for effort and shape

Proceedings of the 27th annual conference on Computer graphics and interactive techniques
Active Appearance Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
BEAT: the Behavior Expression Animation Toolkit

Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Life on the Screen: Identity in the Age of the Internet

Life on the Screen: Identity in the Age of the Internet
MPEG-4 Facial Animation: The Standard,Implementation and Applications

MPEG-4 Facial Animation: The Standard,Implementation and Applications
The Open Agent Architecture

Autonomous Agents and Multi-Agent Systems
Large-Vocabulary Speech Recognition Algorithms

Computer
Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical Modeling in Continuous Speech Recognition (CSR)

UAI '01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence
Recent Advances in Image Morphing

CGI '96 Proceedings of the 1996 Conference on Computer Graphics International
Model-based Integration of Visual Cues for Hand Tracking

MOTION '02 Proceedings of the Workshop on Motion and Video Computing
3D Model Based Gesture Acquisition Using a Single Camera

WACV '02 Proceedings of the Sixth IEEE Workshop on Applications of Computer Vision
High-resolution voice transformation

High-resolution voice transformation
Active Appearance Models Revisited

International Journal of Computer Vision
Synthesizing multimodal utterances for conversational agents: Research Articles

Computer Animation and Virtual Worlds
Mixed feelings: expression of non-basic emotions in a muscle-based talking head

Virtual Reality
Recovering 3D Human Pose from Monocular Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Virtually enhancing the perception of user actions

Proceedings of the 2005 international conference on Augmented tele-existence
Impact of Dynamics on Subspace Embedding and Tracking of Sequences

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
3D People Tracking with Gaussian Process Dynamical Models

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
The Role of Timing in Speech Perception and Speech Production Processes and its Effects on Language Impaired Individuals

BIBE '06 Proceedings of the Sixth IEEE Symposium on BionInformatics and BioEngineering
3D Human Motion Analysis in Monocular Video Techniques and Challenges

AVSS '06 Proceedings of the IEEE International Conference on Video and Signal Based Surveillance
A survey of advances in vision-based human motion capture and analysis

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Image-based 3D face modeling system

EURASIP Journal on Applied Signal Processing
Vision-based human motion analysis: An overview

Computer Vision and Image Understanding
Springer Handbook of Speech Processing

Springer Handbook of Speech Processing
SmartBody: behavior realization for embodied conversational agents

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Distinctiveness in multimodal behaviors

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Proceedings of the 7th international conference on Intelligent Virtual Agents

IVA '07 Proceedings of the 7th international conference on Intelligent Virtual Agents
Towards Natural Gesture Synthesis: Evaluating Gesture Units in a Data-Driven Approach to Gesture Synthesis

IVA '07 Proceedings of the 7th international conference on Intelligent Virtual Agents
The Behavior Markup Language: Recent Developments and Challenges

IVA '07 Proceedings of the 7th international conference on Intelligent Virtual Agents
Model of Facial Expressions Management for an Embodied Conversational Agent

ACII '07 Proceedings of the 2nd international conference on Affective Computing and Intelligent Interaction
Voice disguise and automatic detection: review and perspectives

Progress in nonlinear speech processing
Audio-visual identity verification: an introductory overview

Progress in nonlinear speech processing
The amount of information on emotional states conveyed by the verbal and nonverbal channels: some perceptual data

Progress in nonlinear speech processing
Some experiments in audio-visual speech processing

NOLISP'07 Proceedings of the 2007 international conference on Advances in nonlinear speech processing
Talking to virtual humans: dialogue models and methodologies for embodied conversational agents

ZiF'06 Proceedings of the Embodied communication in humans and machines, 2nd ZiF research group international conference on Modeling communication with robots and virtual humans
Real-time combined 2D+3D active appearance models

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Implementing expressive gesture synthesis for embodied conversational agents

GW'05 Proceedings of the 6th international conference on Gesture in Human-Computer Interaction and Simulation
Facial signs of affect during tutoring sessions

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
Intelligent expressions of emotions

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
Children's organization of discourse structure through pausing means

NOLISP'05 Proceedings of the 3rd international conference on Non-Linear Analyses and Algorithms for Speech Processing
Speech-driven facial animation with realistic dynamics

IEEE Transactions on Multimedia
Fast and reliable active appearance model search for 3-D face tracking

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Cultural Specific Effects on the Recognition of Basic Emotions: A Study on Italian Subjects

USAB '09 Proceedings of the 5th Symposium of the Workgroup Human-Computer Interaction and Usability Engineering of the Austrian Computer Society on HCI and Usability for e-Inclusion
Slovak language model from internet text data

Proceedings of the Third COST 2102 international training school conference on Toward autonomous, adaptive, and context-aware multimodal interfaces: theoretical and practical issues
A cross-cultural study on the perception of emotions: how hungarian subjects evaluate american and italian emotional expressions

COST'11 Proceedings of the 2011 international conference on Cognitive Behavioural Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

Virtual worlds are developing rapidly over the Internet. They are visited by avatars and staffed with Embodied Conversational Agents (ECAs). An avatar is a representation of a physical person. Each person controls one or several avatars and usually receives feedback from the virtual world on an audio-visual display. Ideally, all senses should be used to feel fully embedded in a virtual world. Sound, vision and sometimes touch are the available modalities. This paper reviews the technological developments which enable audio-visual interactions in virtual and augmented reality worlds. Emphasis is placed on speech and gesture interfaces, including talking face analysis and synthesis.