Discovering Objects and their Localization in Images

Authors:
Josef Sivic;Bryan C. Russell;Alexei A. Efros;Andrew Zisserman;William T. Freeman
Affiliations:
University of Oxford;Massachusetts Institute of Technology;Carnegie Mellon University;University of Oxford;Massachusetts Institute of Technology
Venue:
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Year:
2005

Citing 0
Cited 189

Dynamic topic models

ICML '06 Proceedings of the 23rd international conference on Machine learning
MILES: Multiple-Instance Learning via Embedded Instance Selection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Review: Which is the best way to organize/classify images by content?

Image and Vision Computing
The Pyramid Match Kernel: Efficient Learning with Sets of Features

The Journal of Machine Learning Research
Efficient topic-based unsupervised name disambiguation

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Image retrieval on large-scale image databases

Proceedings of the 6th ACM international conference on Image and video retrieval
Multi-level local descriptor quantization for bag-of-visterms image representation

Proceedings of the 6th ACM international conference on Image and video retrieval
Matching ottoman words: an image retrieval approach to historical document indexing

Proceedings of the 6th ACM international conference on Image and video retrieval
Image classification using tensor representation

Proceedings of the 15th international conference on Multimedia
Object categorization

Foundations and Trends® in Computer Graphics and Vision
Generic object recognition with regional statistical models and layer joint boosting

Pattern Recognition Letters
Modeling Semantic Aspects for Cross-Media Image Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence
LabelMe: A Database and Web-Based Tool for Image Annotation

International Journal of Computer Vision
Learning to Recognize Objects with Little Supervision

International Journal of Computer Vision
Describing Visual Scenes Using Transformed Objects and Parts

International Journal of Computer Vision
Unsupervised texture classification: Automatically discover and classify texture patterns

Image and Vision Computing
Making colors worth more than a thousand words

Proceedings of the 2008 ACM symposium on Applied computing
Object recognition and segmentation in videos by connecting heterogeneous visual features

Computer Vision and Image Understanding
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

International Journal of Computer Vision
Patch-based image classification through conditional random field model

Proceedings of the 3rd international conference on Mobile multimedia communications
Identifying relevant frames in weakly labeled videos for training concept detectors

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Language modeling for bag-of-visual words image categorization

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Continuous visual vocabulary modelsfor pLSA-based scene recognition

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Semantic image classification using statistical local spatial relations model

Multimedia Tools and Applications
Boosting with incomplete information

Proceedings of the 25th international conference on Machine learning
Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

International Journal of Computer Vision
Comparing Local Feature Descriptors in pLSA-Based Image Models

Proceedings of the 30th DAGM symposium on Pattern Recognition
Latent dirichlet allocation in web spam filtering

AIRWeb '08 Proceedings of the 4th international workshop on Adversarial information retrieval on the web
Unsupervised modeling and recognition of object categories with combination of visual contents and geometric similarity links

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Categorizing bi-object video activities using bag of segments and causality features

VNBA '08 Proceedings of the 1st ACM workshop on Vision networks for behavior analysis
Some Objects Are More Equal Than Others: Measuring and Predicting Importance

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Contour Based Multi-object Classification Technology

ICIRA '08 Proceedings of the First International Conference on Intelligent Robotics and Applications: Part I
3D Object Recognition Using Hyper-Graphs and Ranked Local Invariant Features

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
3D Object Modeling and Segmentation Based on Edge-Point Matching with Local Descriptors

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing
An Integrated Method for Multiple Object Detection and Localization

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing, Part II
Improving Recognition through Object Sub-categorization

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing, Part II
Video retrieval based on object discovery

Computer Vision and Image Understanding
A New Multiple Kernel Approach for Visual Concept Learning

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Can Geotags Help Image Recognition?

PSIVT '09 Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology
Recognizing Multiple Objects via Regression Incorporating the Co-occurrence of Categories

PSIVT '09 Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology
Region-based image retrieval using color-size features of watershed regions

Journal of Visual Communication and Image Representation
Seeing the Objects Behind the Dots: Recognition in Videos from a Moving Camera

International Journal of Computer Vision
Improving object detection with boosted histograms

Image and Vision Computing
Latent mixture vocabularies for object categorization and segmentation

Image and Vision Computing
Histogram of oriented rectangles: A new pose descriptor for human action recognition

Image and Vision Computing
Regional category parsing in undirected graphical models

Pattern Recognition Letters
Unsupervised modeling of objects and their hierarchical contextual interactions

Journal on Image and Video Processing - Special issue on patches in vision
Contextual classification of image patches with latent aspect models

Journal on Image and Video Processing - Special issue on patches in vision
Multi-view Object Detection Based on Spatial Consistency in a Low Dimensional Space

Proceedings of the 31st DAGM Symposium on Pattern Recognition
Foreground Focus: Unsupervised Learning from Partially Matching Images

International Journal of Computer Vision
Latent Dirichlet Allocation for Automatic Document Categorization

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Unsupervised categorization (filtering) of Google images based on visual consistency

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Sparse B-spline polynomial descriptors for human activity recognition

Image and Vision Computing
A new compositional technique for hand posture recognition

ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
Semantics-preserving bag-of-words models for efficient image annotation

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Tagging and retrieving images with co-occurrence models: from corel to flickr

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Visual ContextRank for web image re-ranking

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Vision for Robotics

Foundations and Trends in Robotics
Places clustering of full-length film key-framesusing latent aspect modeling over SIFT matches

IEEE Transactions on Circuits and Systems for Video Technology
Image annotation within the context of personal photo collections using hierarchical event and scene models

IEEE Transactions on Multimedia - Special issue on integration of context and content
Scale-invariant visual language modeling for object categorization

IEEE Transactions on Multimedia - Special issue on integration of context and content
Robust and efficient feature tracking for indoor navigation

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Learning color names for real-world applications

IEEE Transactions on Image Processing
Automatic fruit and vegetable classification from images

Computers and Electronics in Agriculture
Relational indexing of vectorial primitives for symbol spotting in line-drawing images

Pattern Recognition Letters
A Bag of Features Approach for 3D Shape Retrieval

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part I
Efficient Hypothesis Generation through Sub-categorization for Multiple Object Detection

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part II
Structural Context for Object Categorization

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Visual word pairs for automatic image annotation

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Leveraging social media for training object detectors

DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Unsupervised object of interest discovery in multi-view video sequence

ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Bayesian surprise and landmark detection

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
A Study of Parts-Based Object Class Detection Using Complete Graphs

International Journal of Computer Vision
A compositional technique for hand posture recognition: new results

WSEAS TRANSACTIONS on COMMUNICATIONS
Statistical Methods and Models for Video-Based Tracking, Modeling, and Recognition

Foundations and Trends in Signal Processing
Region-based automatic web image selection

Proceedings of the international conference on Multimedia information retrieval
OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning

International Journal of Computer Vision
Unsupervised Object Discovery: A Comparison

International Journal of Computer Vision
A Hierarchical and Contextual Model for Aerial Image Parsing

International Journal of Computer Vision
Learning natural scene categories by selective multi-scale feature extraction

Image and Vision Computing
Image classification using marginalized kernels for graphs

GbRPR'07 Proceedings of the 6th IAPR-TC-15 international conference on Graph-based representations in pattern recognition
Tutor-based learning of visual categories using different levels of supervision

Computer Vision and Image Understanding
Detecting, localizing and classifying visual traits from arbitrary viewpoints using probabilistic local feature modeling

AMFG'07 Proceedings of the 3rd international conference on Analysis and modeling of faces and gestures
Unsupervised identification of multiple objects of interest from multiple images: dISCOVER

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
Scene context modeling for foreground detection from a scene in remote monitoring

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Kernel fusion for image classification using fuzzy structural information

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Where are focused places of a photo?

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Semi-latent Dirichlet allocation: a hierarchical model for human action recognition

Proceedings of the 2nd conference on Human motion: understanding, modeling, capture and animation
Selecting local region descriptors with a genetic algorithm for real-world place recognition

Evo'08 Proceedings of the 2008 conference on Applications of evolutionary computing
Embedding spatial information into image content description for scene retrieval

Pattern Recognition
A spatially aware generative model for image classification, topic discovery and segmentation

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Action categorization by structural probabilistic latent semantic analysis

Computer Vision and Image Understanding
A bottom-up and top-down model for cell segmentation using multispectral data

ISBI'10 Proceedings of the 2010 IEEE international conference on Biomedical imaging: from nano to Macro
Semantics-preserving bag-of-words models and applications

IEEE Transactions on Image Processing
IPSILON: incremental parsing for semantic indexing of latent concepts

IEEE Transactions on Image Processing
Per-sample multiple kernel approach for visual concept learning

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Fusing semantic aspects for image annotation and retrieval

Journal of Visual Communication and Image Representation
Vicept: link visual features to concepts for large-scale image understanding

Proceedings of the international conference on Multimedia
Unsupervised object category discovery via information bottleneck method

Proceedings of the international conference on Multimedia
Discriminative space-time voting for joint recognition and localization of actions.

Proceedings of the 2nd international workshop on Social signal processing
Automatic attribute discovery and characterization from noisy web data

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Building compact local pairwise codebook with joint feature space clustering

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Part-based feature synthesis for human detection

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Spatial statistics of visual keypoints for texture recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Attribute-based transfer learning for object categorization with zero/one training example

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Image classification using super-vector coding of local image descriptors

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Voting by grouping dependent parts

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Image segmentation with topic random field

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Object recognition with hierarchical stel models

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Figure-ground image segmentation helps weakly-supervised learning of objects

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Large scale visual classification via learned dictionaries and sparse representation

AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
Learning visual object categories and their composition based on a probabilistic latent variable model

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
Bivariate feature localization for SIFT assuming a Gaussian feature shape

ISVC'10 Proceedings of the 6th international conference on Advances in visual computing - Volume Part I
Probabilistic learning of visual object composition from attended segments

ISVC'10 Proceedings of the 6th international conference on Advances in visual computing - Volume Part II
Regularized semi-supervised latent dirichlet allocation for visual concept learning

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Online learning for PLSA-based visual recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Identifying surprising events in videos using bayesian topic models

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
On feature combination and multiple kernel learning for object tracking

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
On the use of implicit shape models for recognition of object categories in 3D data

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Reducing ambiguity in object recognition using relational information

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
Online probabilistic topological mapping

International Journal of Robotics Research
Efficient clustering and quantisation of SIFT features: exploiting characteristics of the SIFT descriptor and interest region detectors under image inversion

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Event detection with spatial latent Dirichlet allocation

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Visual words on baggage X-ray images

CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Unsupervised feature selection and category formation for generic object recognition

CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Spectral clustering of ROIs for object discovery

DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
Learning and classifying actions of construction workers and equipment using Bag-of-Video-Feature-Words and Bayesian network models

Advanced Engineering Informatics
Combined 2D-3D categorization and classification for multimodal perception systems

International Journal of Robotics Research
Hybrid generative-discriminative nucleus classification of renal cell carcinoma

SIMBAD'11 Proceedings of the First international conference on Similarity-based pattern recognition
Regionwise classification of building facade images

PIA'11 Proceedings of the 2011 ISPRS conference on Photogrammetric image analysis
A hierarchical latent topic model based on sparse coding

Neurocomputing
Low-dimensional and comprehensive color texture description

Computer Vision and Image Understanding
Towards unsupervised discovery of visual categories

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Learning of graphical models and efficient inference for object class recognition

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
On the use of topic models for word completion

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Learning pairwise image similarities for multi-classification using Kernel Regression Trees

Pattern Recognition
Sorting unorganized photo sets for urban reconstruction

Graphical Models
Learning semantic features for action recognition via diffusion maps

Computer Vision and Image Understanding
Integrating local action elements for action analysis

Computer Vision and Image Understanding
A robust approach for object recognition

PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Incremental visual objects clustering with the growing vocabulary tree

Multimedia Tools and Applications
Accurate Object Recognition with Shape Masks

International Journal of Computer Vision
Interesting Interest Points

International Journal of Computer Vision
Incorporating spatial correlogram into bag-of-features model for scene categorization

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
A boundary-fragment-model for object detection

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Single-Histogram class models for image segmentation

ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Object localization by subspace clustering of local descriptors

ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Encoding spatial arrangement of visual words

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Semantic parsing of street scenes from video

International Journal of Robotics Research
Leveraging social media for scalable object detection

Pattern Recognition
Efficient storage and decoding of SURF feature points

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Improving performance of topic models by variable grouping

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
3D Scene interpretation by combining probability theory and logic: The tower of knowledge

Computer Vision and Image Understanding
Probabilistic semantic component descriptor

Multimedia Tools and Applications
Multi-modal region selection approach for training object detectors

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Improving Image Classification Using Semantic Attributes

International Journal of Computer Vision
Incorporating shape into spatially-aware adaptive object segmentation algorithm

Proceedings of the Fifth International C* Conference on Computer Science and Software Engineering
Spatial-Based feature for locating objects

ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Semi-supervised vehicle recognition: an approximate region constrained approach

RSKT'12 Proceedings of the 7th international conference on Rough Sets and Knowledge Technology
Stel Component Analysis: Joint Segmentation, Modeling and Recognition of Objects Classes

International Journal of Computer Vision
Human motion retrieval using topic model

Computer Animation and Virtual Worlds
Supervised learning probabilistic Latent Semantic Analysis for human motion analysis

Neurocomputing
Randomized spatial partition for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Ensemble partitioning for unsupervised image categorization

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
"Clustering by composition": unsupervised discovery of image categories

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VII
ISABoost: A weak classifier inner structure adjusting based AdaBoost algorithm-ISABoost based application in scene categorization

Neurocomputing
Unsupervised mining of long time series based on latent topic model

Neurocomputing
Visual categorization based on learning contextual probabilistic latent component tree

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
An improved method of action recognition based on sparse spatio-temporal features

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Are buildings only instances?: exploration in architectural style categories

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing
Latent Dirichlet allocation for image segmentation and source finding in radio astronomy images

Proceedings of the 27th Conference on Image and Vision Computing New Zealand
A novel shape-based non-redundant local binary pattern descriptor for object detection

Pattern Recognition
Beyond Independence: An Extension of the A Contrario Decision Procedure

International Journal of Computer Vision
A region-centered topic model for object discovery and category-based image segmentation

Pattern Recognition
Learning hierarchical bag of words using naive bayes clustering

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Image upscaling using multiple dictionaries of natural image patches

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Sparse online topic models

Proceedings of the 22nd international conference on World Wide Web
Biomedical time series clustering based on non-negative sparse coding and probabilistic topic model

Computer Methods and Programs in Biomedicine
Indoor scene recognition by a mobile robot through adaptive object detection

Robotics and Autonomous Systems
Object class detection: A survey

ACM Computing Surveys (CSUR)
Aesthetic capital: what makes london look beautiful, quiet, and happy?

Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
The multi-feature information bottleneck with application to unsupervised image categorization

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning group-based dictionaries for discriminative image representation

Pattern Recognition
Visual word spatial arrangement for image retrieval and classification

Pattern Recognition
A co-boost framework for learning object categories from Google Images with 1st and 2nd order features

The Visual Computer: International Journal of Computer Graphics
Image categorization using a semantic hierarchy model with sparse set of salient regions

Frontiers of Computer Science: Selected Publications from Chinese Universities
An Improved Hierarchical Dirichlet Process-Hidden Markov Model and Its Application to Trajectory Modeling and Retrieval

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic Latent Semantic Analysis (pLSA). In text analysis this is used to discover topics in a corpus using the bag-of-words document representation. Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. The model is applied to images by using a visual analogue of a word, formed by vector quantizing SIFT-like region descriptors. The topic discovery approach successfully translates to the visual domain: for a small set of objects, we show that both the object categories and their approximate spatial layout are found without supervision. Performance of this unsupervised method is compared to the supervised approach of Fergus et al. [8] on a set of unseen images containing only one object per image. We also extend the bag-of-words vocabulary to include 驴doublets驴 which encode spatially local co-occurring regions. It is demonstrated that this extended vocabulary gives a cleaner image segmentation. Finally, theclassification and segmentation methods are applied to a set of images containing multiple objects per image. These results demonstrate that we can successfully build object class models from an unsupervised analysis of images.