Feature-based image metamorphosis
SIGGRAPH '92 Proceedings of the 19th annual conference on Computer graphics and interactive techniques
Image warping by radial basis functions: applications to facial expressions
CVGIP: Graphical Models and Image Processing
SIGGRAPH '96 Proceedings of the 23rd annual conference on Computer graphics and interactive techniques
Video Rewrite: driving visual speech with audio
Proceedings of the 24th annual conference on Computer graphics and interactive techniques
Proceedings of the 25th annual conference on Computer graphics and interactive techniques
An anthropometric face model using variational techniques
Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Synthesizing realistic facial expressions from photographs
Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Subdivision surfaces in character animation
Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic 3D Cloning and Real-Time Animation of a Human Face
CA '97 Proceedings of the Computer Animation
Sample-Based Synthesis of Photo-Realistic Talking Heads
CA '98 Proceedings of the Computer Animation
Animation of Synthetic Faces in MPEG-4
CA '98 Proceedings of the Computer Animation
PG '98 Proceedings of the 6th Pacific Conference on Computer Graphics and Applications
Realistic mouth synthesis based on shape appearance dependence mapping
Pattern Recognition Letters
Automatic 3d virtual cloning of a speaking human face
Proceedings of the 2010 ACM workshop on Surreal media and virtual cloning
Kernel-Based lip shape clustering with phoneme recognition for real-time voice driven talking face
ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part II
Hi-index | 0.00 |
In this paper, a lifelike talking head system is proposed. The talking head, which is driven by speaker independent speech recognition, requires only one single face image to synthesize lifelike facial expression.The proposed system uses speech recognition engines to get utterances and corresponding time stamps in the speech data. Associated facial expressions can be fetched from an expression pool and the synthetic facial expression can then be synchronized with speech.When applied to Internet, our web-enabled talking head system can be a vivid merchandise narrator, and only requires 50 K bytes/minute with an additional face image (about 40Kbytes in CIF format, 24 bit-color, JPEG compression). The system can synthesize facial animation more than 30 frames/sec on a Pentium II 266 MHz PC.