Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
The Pascal Visual Object Classes (VOC) Challenge
International Journal of Computer Vision
Evaluating Color Descriptors for Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Object Detection with Discriminatively Trained Part-Based Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Discriminative mixture-of-templates for viewpoint classification
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Category independent object proposals
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Detecting people using mutually consistent poselet activations
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Object segmentation by alignment of poselet activations to image contours
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Segmentation as selective search for object recognition
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Ensemble of exemplar-SVMs for object detection and beyond
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Object-Centric spatial pooling for image classification
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Hi-index | 0.00 |
In this paper, we propose a multi-component approach for object detection. Rather than attempting to represent an object category with a monolithic model, or pre-defining a reduced set of aspects, we form visual clusters from the data that are tight in appearance and configuration spaces. We train individual classifiers for each component, and then learn a second classifier that operates at the category level by aggregating responses from multiple components. In order to reduce computation cost during detection, we adopt the idea of object window selection, and our segmentation-based selection mechanism produces fewer than 500 windows per image while preserving high object recall. When compared to the leading methods in the challenging VOC PASCAL 2010 dataset, our multi-component approach obtains highly competitive results. Furthermore, unlike monolithic detection methods, our approach allows the transfer of finer-grained semantic information from the components, such as keypoint location and segmentation masks.