Spatio-temporal embedding for statistical face recognition from video

Authors:
Wei Liu;Zhifeng Li;Xiaoou Tang
Affiliations:
Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China;Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China;Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China
Venue:
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Year:
2006

Citing 14
Cited 3

Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Limits on Super-Resolution and How to Break Them

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exemplar-Based Face Recognition from Video

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Face Recognition Using Temporal Image Sequence

FG '98 Proceedings of the 3rd. International Conference on Face & Gesture Recognition
Comparative Evaluation of Face Sequence Matching for Content-Based Video Access

FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Probabilistic recognition of human faces from video

Computer Vision and Image Understanding - Special issue on Face recognition
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
A Unified Framework for Subspace Face Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Hallucinating Faces: TensorPatch Super-Resolution and Coupled Residue Compensation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Learning a Mahalanobis Metric from Equivalence Constraints

The Journal of Machine Learning Research
Frame synchronization and multi-level subspace analysis for video based face recognition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Video-based face recognition using probabilistic appearance manifolds

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Video-based face recognition using adaptive hidden markov models

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition

An evaluation of video-to-video face verification

IEEE Transactions on Information Forensics and Security
Multi-eigenspace learning for video-based face recognition

ICB'07 Proceedings of the 2007 international conference on Advances in Biometrics
Dual-Feature bayesian MAP classification: exploiting temporal information for video-based face recognition

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the problem of how to learn an appropriate feature representation from video to benefit video-based face recognition. By simultaneously exploiting the spatial and temporal information, the problem is posed as learning Spatio-Temporal Embedding (STE) from raw video. STE of a video sequence is defined as its condensed version capturing the essence of space-time characteristics of the video. Relying on the co-occurrence statistics and supervised signatures provided by training videos, STE preserves the intrinsic temporal structures hidden in video volume, meanwhile encodes the discriminative cues into the spatial domain. To conduct STE, we propose two novel techniques, Bayesian keyframe learning and nonparametric discriminant embedding (NDE), for temporal and spatial learning, respectively. In terms of learned STEs, we derive a statistical formulation to the recognition problem with a probabilistic fusion model. On a large face video database containing more than 200 training and testing sequences, our approach consistently outperforms state-of-the-art methods, achieving a perfect recognition accuracy.