Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition

Authors:
R. Fergus;P. Perona;A. Zisserman
Affiliations:
Department of Engineering Science, University of Oxford, Oxford, U.K. OX1 3PJ;Department of Electrical Engineering, California Institute of Technology, MC 136-93, Pasadena, U.S.A. 91125;Department of Engineering Science, University of Oxford, Oxford, U.K. OX1 3PJ
Venue:
International Journal of Computer Vision
Year:
2007

Citing 22
Cited 43

Localizing Overlapping Parts by Searching the Interpretation Tree

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Markov chain Monte Carlo method: an approach to approximate counting and integration

Approximation algorithms for NP-hard problems
Neural Network-Based Face Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Example-Based Learning for View-Based Human Face Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature Detection with Automatic Scale Selection

International Journal of Computer Vision
A computational model for visual selection

Neural Computation
Saliency, Scale and Image Description

International Journal of Computer Vision
Perceptual Organization and Visual Recognition

Perceptual Organization and Visual Recognition
Computer Vision: A Modern Approach

Computer Vision: A Modern Approach
A Probabilistic Approach to Object Recognition Using Local Photometry and Global Geometry

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Learning a Sparse Representation for Object Detection

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Class-Specific, Top-Down Segmentation

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part II
Probablistic Affine Invariants for Recognition

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Viewpoint-Invariant Learning and Detection of Human Heads

FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Unsupervised learning of models for object recognition

Unsupervised learning of models for object recognition
A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Spatial Priors for Part-Based Recognition Using Statistical Models

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Scale-invariant shape features for recognition of object categories

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Learning methods for generic object recognition with invariance to pose and lighting

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
PCA-SIFT: a more distinctive representation for local image descriptors

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Sharing features: efficient boosting procedures for multiclass object detection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Visual Recognition and Inference Using Dynamic Overcomplete Sparse Learning

Neural Computation
Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

International Journal of Computer Vision
Discovering Constrained Substructures in Bayesian Trees Using the E.M. Algorithm

ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
Learning to Localize Objects with Structured Output Regression

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Weakly Supervised Object Localization with Stable Segmentations

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
An Integrated Method for Multiple Object Detection and Localization

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing, Part II
A Hierarchical Model for the Recognition of Deformable Objects

ICCVG 2008 Proceedings of the International Conference on Computer Vision and Graphics: Revised Papers
Sparse B-spline polynomial descriptors for human activity recognition

Image and Vision Computing
Feature-Based Morphometry

MICCAI '09 Proceedings of the 12th International Conference on Medical Image Computing and Computer-Assisted Intervention: Part II
A Study of Parts-Based Object Class Detection Using Complete Graphs

International Journal of Computer Vision
An Approach to the Parameterization of Structure for Fast Categorization

International Journal of Computer Vision
A model of computation and representation in the brain

Information Sciences: an International Journal
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
Detecting, localizing and classifying visual traits from arbitrary viewpoints using probabilistic local feature modeling

AMFG'07 Proceedings of the 3rd international conference on Analysis and modeling of faces and gestures
Combining models of pose and dynamics for human motion recognition

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Probabilistic combination of visual cues for object classification

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part I
Object category detection using audio-visual cues

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Extracting structures in image collections for object recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Modeling temporal structure of decomposable motion segments for activity classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Weakly supervised classification of objects in images using soft random forests

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
A coarse-to-fine taxonomy of constellations for fast multi-class object detection

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Object recognition using proportion-based prior information: Application to fisheries acoustics

Pattern Recognition Letters
Multimodal biometric human recognition for perceptual human-computer interaction

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Automatic learning sparse correspondences for initialising groupwise registration

MICCAI'10 Proceedings of the 13th international conference on Medical image computing and computer-assisted intervention: Part II
Unsupervised moving object detection with on-line generalized hough transform

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
A Numerical Study of the Bottom-Up and Top-Down Inference Processes in And-Or Graphs

International Journal of Computer Vision
A method for noise-robust context-aware pattern discovery and recognition from categorical sequences

Pattern Recognition
Affine invariant topic model for generic object recognition

ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part II
Accurate Object Recognition with Shape Masks

International Journal of Computer Vision
Retrieval of multiple instances of objects in videos

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
On Taxonomies for Multi-class Image Categorization

International Journal of Computer Vision
Incorporating shape into spatially-aware adaptive object segmentation algorithm

Proceedings of the Fifth International C* Conference on Computer Science and Software Engineering
Similarity constrained latent support vector machine: an application to weakly supervised action classification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VII
Semantic segmentation with second-order pooling

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VII
A relational kernel-based framework for hierarchical image understanding

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Beyond Independence: An Extension of the A Contrario Decision Procedure

International Journal of Computer Vision
MIL-SKDE: Multiple-instance learning with supervised kernel density estimation

Signal Processing
Toward parts-based scene understanding with pixel-support parts-sparse pictorial structures

Pattern Recognition Letters
Detecting partially occluded objects with an implicit shape model random field

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Object class detection: A survey

ACM Computing Surveys (CSUR)
Car detection in sequences of images of urban environments using mixture of deformable part models

Pattern Recognition Letters
Variational learning of finite Dirichlet mixture models using component splitting

Neurocomputing
Discriminative Hough context model for object detection

The Visual Computer: International Journal of Computer Graphics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate a method for learning object categories in a weakly supervised manner. Given a set of images known to contain the target category from a similar viewpoint, learning is translation and scale-invariant; does not require alignment or correspondence between the training images, and is robust to clutter and occlusion. Category models are probabilistic constellations of parts, and their parameters are estimated by maximizing the likelihood of the training data. The appearance of the parts, as well as their mutual position, relative scale and probability of detection are explicitly described in the model. Recognition takes place in two stages. First, a feature-finder identifies promising locations for the model"s parts. Second, the category model is used to compare the likelihood that the observed features are generated by the category model, or are generated by background clutter. The flexible nature of the model is demonstrated by results over six diverse object categories including geometrically constrained categories (e.g. faces, cars) and flexible objects (such as animals).