A Speech Driven Talking Head System Based on a Single Face Image

Authors:
I-Chen Lin;Cheng-Sheng Hung;Tzong-Jer Yang;Ming Ouhyoung
Affiliations:
-;-;-;-
Venue:
PG '99 Proceedings of the 7th Pacific Conference on Computer Graphics and Applications
Year:
1999

Citing 13
Cited 3

Feature-based image metamorphosis

SIGGRAPH '92 Proceedings of the 19th annual conference on Computer graphics and interactive techniques
Image warping by radial basis functions: applications to facial expressions

CVGIP: Graphical Models and Image Processing
View morphing

SIGGRAPH '96 Proceedings of the 23rd annual conference on Computer graphics and interactive techniques
Video Rewrite: driving visual speech with audio

Proceedings of the 24th annual conference on Computer graphics and interactive techniques
Making faces

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
An anthropometric face model using variational techniques

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Synthesizing realistic facial expressions from photographs

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Subdivision surfaces in character animation

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic 3D Cloning and Real-Time Animation of a Human Face

CA '97 Proceedings of the Computer Animation
Sample-Based Synthesis of Photo-Realistic Talking Heads

CA '98 Proceedings of the Computer Animation
Animation of Synthetic Faces in MPEG-4

CA '98 Proceedings of the Computer Animation
Image Talk: A Real Time Synthetic Talking Head Using One Single Image with Chinese Text-To-Speech Capability

PG '98 Proceedings of the 6th Pacific Conference on Computer Graphics and Applications

Realistic mouth synthesis based on shape appearance dependence mapping

Pattern Recognition Letters
Automatic 3d virtual cloning of a speaking human face

Proceedings of the 2010 ACM workshop on Surreal media and virtual cloning
Kernel-Based lip shape clustering with phoneme recognition for real-time voice driven talking face

ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a lifelike talking head system is proposed. The talking head, which is driven by speaker independent speech recognition, requires only one single face image to synthesize lifelike facial expression.The proposed system uses speech recognition engines to get utterances and corresponding time stamps in the speech data. Associated facial expressions can be fetched from an expression pool and the synthetic facial expression can then be synchronized with speech.When applied to Internet, our web-enabled talking head system can be a vivid merchandise narrator, and only requires 50 K bytes/minute with an additional face image (about 40Kbytes in CIF format, 24 bit-color, JPEG compression). The system can synthesize facial animation more than 30 frames/sec on a Pentium II 266 MHz PC.