Visual object detection with deformable part models

  • Authors:
  • Pedro Felzenszwalb;Ross Girshick;David McAllester;Deva Ramanan

  • Affiliations:
  • Brown University;EECS, UC Berkeley;Toyota Technological Institute at Chicago;UC Irvine

  • Venue:
  • Communications of the ACM
  • Year:
  • 2013

Quantified Score

Hi-index 48.22

Visualization

Abstract

We describe a state-of-the-art system for finding objects in cluttered images. Our system is based on deformable models that represent objects using local part templates and geometric constraints on the locations of parts. We reduce object detection to classification with latent variables. The latent variables introduce invariances that make it possible to detect objects with highly variable appearance. We use a generalization of support vector machines to incorporate latent information during training. This has led to a general framework for discriminative training of classifiers with latent variables. Discriminative training benefits from large training datasets. In practice we use an iterative algorithm that alternates between estimating latent values for positive examples and solving a large convex optimization problem. Practical optimization of this large convex problem can be done using active set techniques for adaptive subsampling of the training data.