3D Audiovisual Rendering and Real-Time Interactive Control of Expressivity in a Talking Head

Authors:
Jean-Claude Martin;Christophe D'Alessandro;Christian Jacquemin;Brian Katz;Aurélien Max;Laurent Pointal;Albert Rilliard
Affiliations:
LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France
Venue:
IVA '07 Proceedings of the 7th international conference on Intelligent Virtual Agents
Year:
2007

Citing 8
Cited 1

MPEG-4 Facial Animation: The Standard,Implementation and Applications

MPEG-4 Facial Animation: The Standard,Implementation and Applications
Feature Point Based Mesh Deformation Applied to MPEG-4 Facial Animation

DEFORM '00/AVATARS '00 Proceedings of the IFIP TC5/WG5.10 DEFORM'2000 Workshop and AVATARS'2000 Workshop on Deformable Avatars
Animation of Synthetic Faces in MPEG-4

CA '98 Proceedings of the Computer Animation
Xface: MPEG-4 based open source toolkit for 3D Facial Animation

Proceedings of the working conference on Advanced visual interfaces
Accurate automatic visible speech synthesis of arbitrary 3D models based on concatenation of diviseme motion capture data: Research Articles

Computer Animation and Virtual Worlds
Expressive audio-visual speech: Research Articles

Computer Animation and Virtual Worlds - Special Issue: The Very Best Papers from CASA 2004
Specifying and animating facial signals for discourse in embodied conversational agents: Research Articles

Computer Animation and Virtual Worlds
Annotating multimodal behaviors occurring during non basic emotions

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction

3D Virtual worlds and the metaverse: Current status and future possibilities

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The integration of virtual agents in real-time interactive virtual applications raises several challenges. The rendering of the movements of the virtual character in the virtual scene (locomotion of the character or rotation of its head) and the binaural rendering in 3D of the synthetic speech during these movements need to be spatially coordinated. Furthermore, the system must enable real-time adaptation of the agent's expressive audiovisual signals to user's on-going actions. In this paper, we describe a platform that we have designed to address these challenges as follows: (1) the modules enabling real time synthesis and spatial rendering of the synthetic speech, (2) the modules enabling 3D real time rendering of facial expressions using a GPU-based 3D graphic engine, and (3) the integration of these modules within an experimental platform using gesture as an input modality. A new model of phoneme-dependent human speech directivity patterns is included in the speech synthesis system, so that the agent can move in the virtual scene with realistic 3D visual and audio rendering. Future applications of this platform include perceptual studies about multimodal perception and interaction, expressive real time question and answer system and interactive arts.