Multi-View AAM Fitting and Construction

Authors:
Krishnan Ramnath;Seth Koterba;Jing Xiao;Changbo Hu;Iain Matthews;Simon Baker;Jeffrey Cohn;Takeo Kanade
Affiliations:
Objectvideo Inc., Reston, USA 20191;The Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213;Epson Palo Alto Laboratory, Epson Research & Development, San Jose, USA 95131;The Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213;The Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213;Microsoft Research, Microsoft Corporation, Redmond, USA 98052;Department of Psychology, University of Pittsburgh, Pittsburgh, USA 15260;The Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213
Venue:
International Journal of Computer Vision
Year:
2008

Citing 28
Cited 4

Binocular Image Flows: Steps Toward Stereo-Motion Fusion

IEEE Transactions on Pattern Analysis and Machine Intelligence
Estimation of Displacements from Two 3-D Frames Obtained From Stereo

IEEE Transactions on Pattern Analysis and Machine Intelligence
Shape Ambiguities in Structure From Motion

IEEE Transactions on Pattern Analysis and Machine Intelligence
Linear Object Classes and Image Synthesis From a Single Example Image

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Region Tracking With Parametric Models of Geometry and Illumination

IEEE Transactions on Pattern Analysis and Machine Intelligence
A morphable model for the synthesis of 3D faces

Proceedings of the 26th annual conference on Computer graphics and interactive techniques
Multiple view geometry in computer visiond

Multiple view geometry in computer visiond
Active Appearance Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Three D-Dynamic Scene Analysis: A Stereo Based Approach

Three D-Dynamic Scene Analysis: A Stereo Based Approach
Active Appearance Models

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Optimal Structure from Motion: Local Ambiguities and Global Estimates

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
In defence of the 8-point algorithm

ICCV '95 Proceedings of the Fifth International Conference on Computer Vision
Using the Active Appearance Algorithm for Face and Facial Feature Tracking

RATFG-RTS '01 Proceedings of the IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems (RATFG-RTS'01)
Active Blobs

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Multidimensional Morphable Models

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Active blobs: region-based, deformable appearance models

Computer Vision and Image Understanding - Special issue on nonrigid image registration
Efficient, Robust and Accurate Fitting of a 3D Morphable Model

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Capturing Subtle Facial Motions in 3D Face Tracking

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Lucas-Kanade 20 Years On: A Unifying Framework

International Journal of Computer Vision
Appearance-Based Face Recognition and Light-Fields

IEEE Transactions on Pattern Analysis and Machine Intelligence
Active Appearance Models Revisited

International Journal of Computer Vision
Automatic Construction of Active Appearance Models as an Image Coding Problem

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-View AAM Fitting and Camera Calibration

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Uncalibrated Perspective Reconstruction of Deformable Structures

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
2D vs. 3D Deformable Face Models: Representational Power, Construction, and Real-Time Fitting

International Journal of Computer Vision
Active appearance models with occlusion

Image and Vision Computing
Real-time combined 2D+3D active appearance models

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Fast and reliable active appearance model search for 3-D face tracking

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Real time head pose estimation from consumer depth cameras

DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
A realistic dynamic facial expression transfer method

Neurocomputing
Random Forests for Real Time 3D Face Analysis

International Journal of Computer Vision
Exploiting depth and intensity information for head pose estimation with random forests and tensor models

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

Active Appearance Models (AAMs) are generative, parametric models that have been successfully used in the past to model deformable objects such as human faces. The original AAMs formulation was 2D, but they have recently been extended to include a 3D shape model. A variety of single-view algorithms exist for fitting and constructing 3D AAMs but one area that has not been studied is multi-view algorithms. In this paper we present multi-view algorithms for both fitting and constructing 3D AAMs. Fitting an AAM to an image consists of minimizing the error between the input image and the closest model instance; i.e. solving a nonlinear optimization problem. In the first part of the paper we describe an algorithm for fitting a single AAM to multiple images, captured simultaneously by cameras with arbitrary locations, rotations, and response functions. This algorithm uses the scaled orthographic imaging model used by previous authors, and in the process of fitting computes, or calibrates, the scaled orthographic camera matrices. In the second part of the paper we describe an extension of this algorithm to calibrate weak perspective (or full perspective) camera models for each of the cameras. In essence, we use the human face as a (non-rigid) calibration grid. We demonstrate that the performance of this algorithm is roughly comparable to a standard algorithm using a calibration grid. In the third part of the paper, we show how camera calibration improves the performance of AAM fitting. A variety of non-rigid structure-from-motion algorithms, both single-view and multi-view, have been proposed that can be used to construct the corresponding 3D non-rigid shape models of a 2D AAM. In the final part of the paper, we show that constructing a 3D face model using non-rigid structure-from-motion suffers from the Bas-Relief ambiguity and may result in a "scaled" (stretched/compressed) model. We outline a robust non-rigid motion-stereo algorithm for calibrated multi-view 3D AAM construction and show how using calibrated multi-view motion-stereo can eliminate the Bas-Relief ambiguity and yield face models with higher 3D fidelity.