Visual Estimation and Compression of Facial Motion Parameters—Elements of a 3D Model-Based Video Coding System

  • Authors:
  • Hai Tao;Thomas S. Huang

  • Affiliations:
  • Department of Computer Engineering, University of California, Santa Cruz, CA 95064, USA. tao@soe.ucsc.edu;Image Processing and Formation Laboratory, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. huang@ifp.uiuc.edu

  • Venue:
  • International Journal of Computer Vision
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The MPEG4 standard supports the transmission and composition of facial animation with natural video by including a facial animation parameter (FAP) set that is defined based on the study of minimal facial actions and is closely related to muscle actions. The FAP set enables model-based representation of natural or synthetic talking head sequences and allows intelligible visual reproduction of facial expressions, emotions, and speech pronunciations at the receiver. This paper describes two key components we have developed for building a model-based video coding system: (1) a method for estimating FAP parameters based on our previously proposed piecewise Bézier volume deformation model (PBVD), and (2) various methods for encoding FAP parameters. PBVD is a linear deformation model suitable for both the synthesis and the analysis of facial images. Each FAP parameter is a basis function in this model. Experimental results on PBVD-based animation, model-based tracking, and spatial-temporal compression of FAP parameters are demonstrated in this paper.