Object and Action Classification with Latent Window Parameters

  • Authors:
  • Hakan Bilen;Vinay P. Namboodiri;Luc J. Gool

  • Affiliations:
  • ESAT-PSI/iMinds, Ku Leuven, Heverlee, Belgium 3001;Alcatel-Lucent Bell Labs, Antwerp, Belgium 2018;ESAT-PSI/iMinds, Ku Leuven, Heverlee, Belgium 3001 and Computer Vision Laboratory, ETH Zürich, Zurich, Switzerland 8092

  • Venue:
  • International Journal of Computer Vision
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose a generic framework to incorporate unobserved auxiliary information for classifying objects and actions. This framework allows us to automatically select a bounding box and its quadrants from which best to extract features. These spatial subdivisions are learnt as latent variables. The paper is an extended version of our earlier work Bilen et al. (Proceedings of The British Machine Vision Conference, 2011), complemented with additional ideas, experiments and analysis. We approach the classification problem in a discriminative setting, as learning a max-margin classifier that infers the class label along with the latent variables. Through this paper we make the following contributions: (a) we provide a method for incorporating latent variables into object and action classification; (b) these variables determine the relative focus on foreground versus background information that is taken account of; (c) we design an objective function to more effectively learn in unbalanced data sets; (d) we learn a better classifier by iterative expansion of the latent parameter space. We demonstrate the performance of our approach through experimental evaluation on a number of standard object and action recognition data sets.