Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

Authors:
Andreas Opelt;Axel Pinz;Andrew Zisserman
Affiliations:
Institute of Electrical Measurement and Measurement Signal Processing, Graz University of Technology, Graz, Austria;Institute of Electrical Measurement and Measurement Signal Processing, Graz University of Technology, Graz, Austria;Department of Engineering Science, University of Oxford, Oxford, UK
Venue:
International Journal of Computer Vision
Year:
2008

Citing 45
Cited 12

Hierarchical Chamfer Matching: A Parametric Edge Matching Algorithm

IEEE Transactions on Pattern Analysis and Machine Intelligence
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
Mean Shift: A Robust Approach Toward Feature Space Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Linear Time Euclidean Distance Algorithms

IEEE Transactions on Pattern Analysis and Machine Intelligence
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Object Recognition with Informative Features and Linear Classification

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Object Categorization via Local Kernels

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Pictorial Structures for Object Recognition

International Journal of Computer Vision
Learning to Detect Objects in Images via a Sparse, Part-Based Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Coarse-to-Fine Strategy for Multiclass Shape Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Spatial Priors for Part-Based Recognition Using Statistical Models

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Cross-Generalization: Learning Novel Classes from a Single Example by Feature Replacement

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Object Class Recognition by Boosting a Part-Based Model

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Efficient Multiclass Object Detection by a Hierarchy of Classifiers

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Discriminative Training for Object Recognition Using Image Patches

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Object Class Recognition Using Multiple Layer Boosting with Heterogeneous Features

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Part-Based Statistical Models for Object Classification and Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Fast Spatial Pattern Discovery Integrating Boosting with Constellations of Contextual Descriptors

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Feature Hierarchies for Object Classification

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Discovering Objects and their Localization in Images

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Contour-Based Learning for Object Detection

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Probabilistic Boosting-Tree: Learning Discriminative Models for Classification, Recognition, and Clustering

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Object Categorization by Learned Universal Visual Dictionary

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Generic Object Recognition with Boosting

IEEE Transactions on Pattern Analysis and Machine Intelligence
Incremental learning of object detectors using a visual shape alphabet

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Multiclass Object Recognition with Sparse, Localized Features

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Multiple Object Class Detection with a Generative Model

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Spatial Weighting for Bag-of-Features

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Multi-Aspect Detection of Articulated Objects

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Towards Multi-View Object Class Detection

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Using Dependent Regions for Object Categorization in a Generative Framework

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition

International Journal of Computer Vision
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Scale-invariant shape features for recognition of object categories

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Sharing features: efficient boosting procedures for multiclass object detection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
TextonBoost: joint appearance, shape and context modeling for multi-class object recognition and segmentation

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
The 2005 PASCAL visual object classes challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Object detection by contour segment networks

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
Learning compositional categorization models

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
A boundary-fragment-model for object detection

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II

Constructing vocabulary ensembles by different clustering algorithms for object categorization

IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Action categorization by structural probabilistic latent semantic analysis

Computer Vision and Image Understanding
Backprojection revisited: scalable multi-view object detection and similarity metrics for detections

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
A coarse-to-fine taxonomy of constellations for fast multi-class object detection

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Context modeling in computer vision: techniques, implications, and applications

Multimedia Tools and Applications
Skeleton Search: Category-Specific Object Recognition and Segmentation Using a Skeletal Shape Model

International Journal of Computer Vision
Multiclass object detection by combining local appearances and context

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Distance images and intermediate-level vision

SSVM'11 Proceedings of the Third international conference on Scale Space and Variational Methods in Computer Vision
A new method for hand detection based on hough forest

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
View-Invariant object detection by matching 3d contours

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume 2
Large scale continuous visual event recognition using max-margin Hough transformation framework

Computer Vision and Image Understanding
Multi-class boosting with asymmetric binary weak-learners

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel algorithmic approach to object categorization and detection that can learn category specific detectors, using Boosting, from a visual alphabet of shape and appearance. The alphabet itself is learnt incrementally during this process. The resulting representation consists of a set of category-specific descriptors--basic shape features are represented by boundary-fragments, and appearance is represented by patches--where each descriptor in combination with centroid vectors for possible object centroids (geometry) forms an alphabet entry. Our experimental results highlight several qualities of this novel representation. First, we demonstrate the power of purely shape-based representation with excellent categorization and detection results using a Boundary-Fragment-Model (BFM), and investigate the capabilities of such a model to handle changes in scale and viewpoint, as well as intra- and inter-class variability. Second, we show that incremental learning of a BFM for many categories leads to a sub-linear growth of visual alphabet entries by sharing of shape features, while this generalization over categories at the same time often improves categorization performance (over independently learning the categories). Finally, the combination of basic shape and appearance (boundary-fragments and patches) features can further improve results. Certain feature types are preferred by certain categories, and for some categories we achieve the lowest error rates that have been reported so far.