Geometric video approximation using weighted matching pursuit

  • Authors:
  • Òscar Divorra Escoda;Gianluca Monaci;Rosa M. Figueras i Ventura;Pierre Vandergheynst;Michel Bierlaire

  • Affiliations:
  • Telefonica Research, Barcelona, Spain and Signal Processing Institute, Ecole Polytechnique Fédérale de Lausanne, Switzerland;Philips Research, Eindhoven, The Netherlands and Signal Processing Institute, Ecole Polytechnique Fédérale de Lausanne, Switzerland;Group of Interactive Coding of Image, Universitat Autonoma de Barcelona, Catalonia, Spain and Signal Processing Institute, Ecole Polytechnique Fédérale de Lausanne, Switzerland;Signal Processing Laboratory 2, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland;Transport and Mobility Laboratory, EPFL and Mathematics Institute, Ecole Polytechnique Fédérale de Lausanne, Switzerland

  • Venue:
  • IEEE Transactions on Image Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

In recent years, works on geometric multidimensional signal representations have established a close relation with signal expansions on redundant dictionaries. For this purpose, matching pursuits (MP) have shown to be an interesting tool. Recently, most important limitations of MP have been underlined, and alternative algorithms like weighted-MP have been proposed. This work explores the use of weighted-MP as a new framework for motion-adaptive geometric video approximations. We study a novel algorithm to decompose video sequences in terms of few, salient video components that jointly represent the geometric and motion content of a scene. Experimental coding results on highly geometric content reflect how the proposed paradigm exploits spatio-temporal video geometry. Two-dimensional weighted-MP improves the representation compared to those based on 2-D MP. Furthermore, the extracted video components represent relevant visual structures with high saliency. In an example application, such components are effectively used as video descriptors for the joint audio-video analysis of multimedia sequences.