Learning compositional categorization models

Authors:
Björn Ommer;Joachim M. Buhmann
Affiliations:
Institute of Computational Science, ETH Zurich, Zurich, Switzerland;Institute of Computational Science, ETH Zurich, Zurich, Switzerland
Venue:
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
Year:
2006

Citing 15
Cited 11

Classification by pairwise coupling

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Perceptual Organization and Visual Recognition

Perceptual Organization and Visual Recognition
Distortion Invariant Object Recognition in the Dynamic Link Architecture

IEEE Transactions on Computers
Unsupervised Learning of Models for Recognition

ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part I
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Pictorial Structures for Object Recognition

International Journal of Computer Vision
Learning to Detect Objects in Images via a Sparse, Part-Based Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Combining Top-Down and Bottom-Up Segmentation

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 4 - Volume 04
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Shape Matching and Object Recognition Using Low Distortion Correspondences

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Combining Generative Models and Fisher Kernels for Object Recognition

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction (Stochastic Modelling and Applied Probability)

Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction (Stochastic Modelling and Applied Probability)
The Representation and Matching of Pictorial Structures

IEEE Transactions on Computers
Object categorization by compositional graphical models

EMMCVPR'05 Proceedings of the 5th international conference on Energy Minimization Methods in Computer Vision and Pattern Recognition

A stochastic grammar of images

Foundations and Trends® in Computer Graphics and Vision
Object retrieval using configurations of salient regions

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

International Journal of Computer Vision
Object Tracking Using Grayscale Appearance Models and Swarm Based Particle Filter

HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Seeing the Objects Behind the Dots: Recognition in Videos from a Moving Camera

International Journal of Computer Vision
Object segmentation in video via graph cut built on superpixels

Fundamenta Informaticae - Cognitive Informatics, Cognitive Computing, and Their Denotational Mathematical Foundations (II)
A new compositional technique for hand posture recognition

ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
A compositional technique for hand posture recognition: new results

WSEAS TRANSACTIONS on COMMUNICATIONS
Compositional object recognition, segmentation, and tracking in video

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Exploiting low-level image segmentation for object recognition

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Object segmentation in video via graph cut built on superpixels

Fundamenta Informaticae - Cognitive Informatics, Cognitive Computing, and Their Denotational Mathematical Foundations (II)

Quantified Score

Hi-index	0.01

Visualization

Abstract

This contribution proposes a compositional approach to visual object categorization of scenes. Compositions are learned from the Caltech 101 database and form intermediate abstractions of images that are semantically situated between low-level representations and the high-level categorization. Salient regions, which are described by localized feature histograms, are detected as image parts. Subsequently compositions are formed as bags of parts with a locality constraint. After performing a spatial binding of compositions by means of a shape model, coupled probabilistic kernel classifiers are applied thereupon to establish the final image categorization. In contrast to the discriminative training of the categorizer, intermediate compositions are learned in a generative manner yielding relevant part agglomerations, i.e. groupings which are frequently appearing in the dataset while simultaneously supporting the discrimination between sets of categories. Consequently, compositionality simplifies the learning of a complex categorization model for complete scenes by splitting it up into simpler, sharable compositions. The architecture is evaluated on the highly challenging Caltech 101 database which exhibits large intra-category variations. Our compositional approach shows competitive retrieval rates in the range of 53.6 ± 0.88% or, with a multi-scale feature set, rates of 57.8 ± 0.79%.