Audio-guided video-based face recognition

Authors:
Xiaoou Tang;Zhifeng Li
Affiliations:
Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China;Human-Computer Communications Laboratory, Department of Systems Engineering and Engineering Management, Chinese University of Hong Kong, Hong Kong, China
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
2009

Citing 24
Cited 1

Decision Combination in Multiple Classifier Systems

IEEE Transactions on Pattern Analysis and Machine Intelligence
Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Face Recognition by Elastic Bunch Graph Matching

IEEE Transactions on Pattern Analysis and Machine Intelligence
Active Appearance Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Empirical Performance Analysis of Linear Discriminant Classifiers

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Face Recognition Using Temporal Image Sequence

FG '98 Proceedings of the 3rd. International Conference on Face & Gesture Recognition
Comparative Evaluation of Face Sequence Matching for Content-Based Video Access

FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Exemplar-Based Face Recognition from Video

FGR '02 Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition
Linear Dimensionality Reduction via a Heteroscedastic Extension of LDA: The Chernoff Criterion

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Unified Framework for Subspace Face Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Face Recognition Using Laplacianfaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Online Learning of Probabilistic Appearance Manifolds for Video-Based Recognition and Tracking

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Nonparametric Subspace Analysis for Face Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Capitalize on Dimensionality Increasing Techniques for Improving Face Recognition Grand Challenge Performance

IEEE Transactions on Pattern Analysis and Machine Intelligence
Globally Maximizing, Locally Minimizing: Unsupervised Discriminant Projection with Applications to Face and Palm Biometrics

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian face recognition using support vector machine and face clustering

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Frame synchronization and multi-level subspace analysis for video based face recognition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Video based face recognition using multiple classifiers

FGR' 04 Proceedings of the Sixth IEEE international conference on Automatic face and gesture recognition
Video-based face recognition using probabilistic appearance manifolds

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Video-based face recognition using adaptive hidden markov models

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Using Support Vector Machines to Enhance the Performance of Bayesian Face Recognition

IEEE Transactions on Information Forensics and Security
Texture information in run-length matrices

IEEE Transactions on Image Processing
Model-based temporal object verification using video

IEEE Transactions on Image Processing
A generic approach to simultaneous tracking and verification in video

IEEE Transactions on Image Processing

A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we develop a new video-to-video face recognition algorithm. The major advantage of the videobased method is that more information is available in a video sequence than in a single image. In order to take advantage of the large amount of information in the video sequence and at the same time overcome the processing speed and data size problems, we develop several new techniques including temporal and spatial frame synchronization, multilevel discriminant subspace analysis, and multiclassifier integration for video sequence processing. An aligned video sequence for each person is first obtained by applying temporal and spatial synchronization, which effectively establishes the face correspondence using both audio and video information; then multilevel discriminant subspace analysis or multiclassifier integration is employed for further analysis based on the synchronized sequence. The method preserves most of the temporal-spatial information contained in a video sequence. Extensive experiments on the XM2VTS database clearly show the superiority of our new algorithms with near-perfect classification results (99.3%) obtained.