Contextual motion field-based distance for video analysis

  • Authors:
  • Yadong Mu;Shuicheng Yan;Thomas Huang;Bingfeng Zhou

  • Affiliations:
  • Peking University, Institute of Computer Science and Technology, 100871, Beijing, P.R. China;National University of Singapore, ECE Department, 117576, Singapore, Singapore;University of Illinois at Urbana-Champaign, ECE Department, 61801, Urbana, IL, USA;Peking University, Institute of Computer Science and Technology, 100871, Beijing, P.R. China

  • Venue:
  • The Visual Computer: International Journal of Computer Graphics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work, we propose a general method for computing distance between video frames or sequences. Unlike conventional appearance-based methods, we first extract motion fields from original videos. To avoid the huge memory requirement demanded by the previous approaches, we utilize the “bag of motion vectors” model, and select Gaussian mixture model as compact representation. Thus, estimating distance between two frames is equivalent to calculating the distance between their corresponding Gaussian mixture models, which is solved via earth mover distance (EMD) in this paper. On the basis of the inter-frame distance, we further develop the distance measures for both full video sequences. Our main contribution is four-fold. Firstly, we operate on a tangent vector field of spatio-temporal 2D surface manifold generated by video motions, rather than the intensity gradient space. Here we argue that the former space is more fundamental. Secondly, the correlations between frames are explicitly exploited using a generative model named dynamic conditional random fields (DCRF). Under this framework, motion fields are estimated by Markov volumetric regression, which is more robust and may avoid the rank deficiency problem. Thirdly, our definition for video distance is in accord with human intuition and makes a better tradeoff between frame dissimilarity and chronological ordering. Lastly, our definition for frame distance allows for partial distance.