Spatio-temporal embedding for statistical face recognition from video

  • Authors:
  • Wei Liu;Zhifeng Li;Xiaoou Tang

  • Affiliations:
  • Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China;Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China;Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China

  • Venue:
  • ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the problem of how to learn an appropriate feature representation from video to benefit video-based face recognition. By simultaneously exploiting the spatial and temporal information, the problem is posed as learning Spatio-Temporal Embedding (STE) from raw video. STE of a video sequence is defined as its condensed version capturing the essence of space-time characteristics of the video. Relying on the co-occurrence statistics and supervised signatures provided by training videos, STE preserves the intrinsic temporal structures hidden in video volume, meanwhile encodes the discriminative cues into the spatial domain. To conduct STE, we propose two novel techniques, Bayesian keyframe learning and nonparametric discriminant embedding (NDE), for temporal and spatial learning, respectively. In terms of learned STEs, we derive a statistical formulation to the recognition problem with a probabilistic fusion model. On a large face video database containing more than 200 training and testing sequences, our approach consistently outperforms state-of-the-art methods, achieving a perfect recognition accuracy.