A data-driven detection optimization framework

  • Authors:
  • William Robson Schwartz;Victor Hugo Cunha De Melo;Helio Pedrini;Larry S. Davis

  • Affiliations:
  • Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, MG, 31270-010 Brazil;Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, MG, 31270-010 Brazil;Institute of Computing, University of Campinas, Campinas, SP, 13083-852, Brazil;Department of Computer Science, University of Maryland, College Park 20740, USA

  • Venue:
  • Neurocomputing
  • Year:
  • 2013

Quantified Score

Hi-index 0.01

Visualization

Abstract

Due to the large amount of data to be processed by visual applications aiming at extracting high-level understanding of the scene, low-level methods such as object detection are required to have not only high accuracy but also low computational cost in order to provide fast and reliable information. Training sets containing samples representing multiple scenes are used to learn object detectors that can be reliably used in different scenarios. In general, information extracted from multiple feature channels is combined to capture the large variability present in these different environments. Although this approach provides accurate detection results, it usually leads to a high computational cost. On the other hand, if characteristics of the scene are known before-hand, a set of simple and fast computing features might be sufficient to provide high accuracy at a low computational cost. Therefore, it is valuable to seek a balance between these two extremes such that the detection method not only works well in different scenarios but also is able to extract enough information from a scene. We integrate a set of data-driven regression models with a multi-stage based human detection method trained to be used in different environments. The regressions are used to estimate the detector response at each stage and the location of the objects. The use of the regression models allows the method to reject large number of detection windows quickly. Experimental results based on human detection show that the addition of the regression models reduces the computational cost by as much as ten times with very small or no degradation on detection accuracy.