Describing Visual Scenes Using Transformed Objects and Parts

Authors:
Erik B. Sudderth;Antonio Torralba;William T. Freeman;Alan S. Willsky
Affiliations:
Computer Science Division, University of California, Berkeley, USA;Electrical Engineering & Computer Science, Massachusetts Institute of Technology, Cambridge, USA;Electrical Engineering & Computer Science, Massachusetts Institute of Technology, Cambridge, USA;Electrical Engineering & Computer Science, Massachusetts Institute of Technology, Cambridge, USA
Venue:
International Journal of Computer Vision
Year:
2008

Citing 30
Cited 19

A Computational Approach to Edge Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Shape Matching and Object Recognition Using Shape Contexts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Transformation-Invariant Clustering Using the EM Algorithm

IEEE Transactions on Pattern Analysis and Machine Intelligence
Contextual Priming for Object Detection

International Journal of Computer Vision
Class-Specific, Top-Down Segmentation

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part II
Transformation Invariance in Pattern Recognition-Tangent Distance and Tangent Propagation

Neural Networks: Tricks of the Trade, this book is an outgrowth of a 1996 NIPS workshop
Image Modeling with Position-Encoding Dynamic Trees

IEEE Transactions on Pattern Analysis and Machine Intelligence
Latent dirichlet allocation

The Journal of Machine Learning Research
Matching words and pictures

The Journal of Machine Learning Research
Robust Real-Time Face Detection

International Journal of Computer Vision
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Object Class Recognition with Many Local Features

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
The author-topic model for authors and documents

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Image Parsing: Unifying Segmentation, Detection, and Recognition

International Journal of Computer Vision
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Discovering Objects and their Localization in Images

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Hierarchical Models of Scenes, Objects, and Parts

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Learning Object Categories from Google"s Image Search

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Context and Hierarchy in a Probabilistic Image Model

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Depth from Familiar Objects: A Hierarchical Model for 3D Scenes

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Graphical models for visual object recognition and tracking

Graphical models for visual object recognition and tracking
BLOG: probabilistic models with unknown objects

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Learning methods for generic object recognition with invariance to pose and lighting

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multiscale conditional random fields for image labeling

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Sharing features: efficient boosting procedures for multiclass object detection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Spatial Random Tree Grammars for Modeling Hierarchal Structure in Images with Regions of Arbitrary Shape

IEEE Transactions on Pattern Analysis and Machine Intelligence

Clustering Using Class Specific Hyper Graphs

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Learning Deep Architectures for AI

Foundations and Trends® in Machine Learning
An Approach to the Parameterization of Structure for Fast Categorization

International Journal of Computer Vision
Learning Visual Object Categories for Robot Affordance Prediction

International Journal of Robotics Research
Category Level Object Segmentation by Combining Bag-of-Words Models with Dirichlet Processes and Random Fields

International Journal of Computer Vision
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
Comparing compact codebooks for visual categorization

Computer Vision and Image Understanding
Classification and Semantic Mapping of Urban Environments

International Journal of Robotics Research
A Numerical Study of the Bottom-Up and Top-Down Inference Processes in And-Or Graphs

International Journal of Computer Vision
Not far away from home: a relational distance-based approach to understanding images of houses

ILP'10 Proceedings of the 20th international conference on Inductive logic programming
Local feature based tensor kernel for image manifold learning

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets

International Journal of Computer Vision
Trajectory Analysis and Semantic Region Modeling Using Nonparametric Hierarchical Bayesian Models

International Journal of Computer Vision
Learning pairwise image similarities for multi-classification using Kernel Regression Trees

Pattern Recognition
Integrating local action elements for action analysis

Computer Vision and Image Understanding
Learning logic rules for scene interpretation based on markov logic networks

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
A Review of Codebook Models in Patch-Based Visual Object Recognition

Journal of Signal Processing Systems
Object Detection using Geometrical Context Feedback

International Journal of Computer Vision
Auto learning temporal atomic actions for activity classification

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop hierarchical, probabilistic models for objects, the parts composing them, and the visual scenes surrounding them. Our approach couples topic models originally developed for text analysis with spatial transformations, and thus consistently accounts for geometric constraints. By building integrated scene models, we may discover contextual relationships, and better exploit partially labeled training images. We first consider images of isolated objects, and show that sharing parts among object categories improves detection accuracy when learning from few examples. Turning to multiple object scenes, we propose nonparametric models which use Dirichlet processes to automatically learn the number of parts underlying each object category, and objects composing each scene. The resulting transformed Dirichlet process (TDP) leads to Monte Carlo algorithms which simultaneously segment and recognize objects in street and office scenes.