Biological Motion of Speech

Authors:
Gregor A. Kalberer;Pascal Müller;Luc J. Van Gool
Affiliations:
-;-;-
Venue:
BMCV '02 Proceedings of the Second International Workshop on Biologically Motivated Computer Vision
Year:
2002

Citing 4
Cited 0

Making faces

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Synthesizing realistic facial expressions from photographs

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Expression cloning

Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Principal Components of Expressive Speech Animation

CGI '01 Proceedings of the International Conference on Computer Graphics

Quantified Score

Hi-index	0.01

Visualization

Abstract

The paper discusses the detailed analysis of visual speech. As with other forms of biological motion, humans are known to be very sensitive to the realism in the ways the lips move. In order to determine the elements that come to play in the perceptual analysis of visual speech, it is important to have control over the data. The paper discusses the capture of detailed 3D deformations of faces when talking. The data are detailed in both a temporal and spatial sense. The 3D positions of thousands of points on the face are determined at the temporal resolution of video. Such data have been decomposed into their basic modes, using ICA. It is noteworthy that this yielded better results than a mere PCA analysis, which results in modes that individually represent facial changes that anatomically inconsistent. The ICs better capture the underlying, anatomical changes that the face undergoes. Different visemes are all based on the underlying, joint action of the facial muscles. The IC modes do not reflect single muscles, but nevertheless decompose the speech related deformations into anatomically convincing modes, coined 'pseudo-muscles'.