Image Parsing: Unifying Segmentation, Detection, and Recognition

Authors:
Zhuowen Tu;Xiangrong Chen;Alan L. Yuille;Song-Chun Zhu
Affiliations:
Departments of Statistics, University of California, Los Angeles, Los Angeles, USA 90095;Departments of Statistics, University of California, Los Angeles, Los Angeles, USA 90095;Departments of `Statistics' and `Psychology', University of California, Los Angeles, Los Angeles, USA 90095;Departments of `Statistics' and `Computer Science', University of California, Los Angeles, Los Angeles, USA 90095
Venue:
International Journal of Computer Vision
Year:
2005

Citing 26
Cited 66

An introduction to digital image processing

An introduction to digital image processing
A Computational Approach to Edge Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Features and objects in visual processing

Scientific American
Diffusions for global optimizations

SIAM Journal on Control and Optimization
Elements of information theory

Elements of information theory
The Helmholtz machine

Neural Computation
Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Probabilistic Visual Learning for Object Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Two- and three-dimensional patterns of the face

Two- and three-dimensional patterns of the face
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Active Appearance Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Image Segmentation by Data-Driven Markov Chain Monte Carlo

IEEE Transactions on Pattern Analysis and Machine Intelligence
Contour and Texture Analysis for Image Segmentation

International Journal of Computer Vision
Edge detector evaluation using empirical ROC curves

Computer Vision and Image Understanding - Special issue on empirical evaluation of computer vision algorithms
Shape Matching and Object Recognition Using Shape Contexts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical Edge Detection: Learning and Evaluating Edge Cues

IEEE Transactions on Pattern Analysis and Machine Intelligence
Parsing Images into Region and Curve Processes

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part III
Mean Shift Analysis and Applications

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image

HLK '03 Proceedings of the First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis
Graph Partition by Swendsen-Wang Cuts

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Discriminative Random Fields: A Discriminative Framework for Contextual Interaction in Classification

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
A generative constituent-context model for improved grammar induction

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Detecting and reading text in natural scenes

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multigrid and multi-level Swendsen-Wang cuts for hierarchic graph partition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Segmentation and description of natural outdoor scenes

Image and Vision Computing
Robust model-based scene interpretation by multilayered context information

Computer Vision and Image Understanding
Primal sketch: Integrating structure and texture

Computer Vision and Image Understanding
Efficient Shape Modeling: ⋮-Entropy, Adaptive Coding, and Boundary Curves -vs- Blum's Medial Axis

International Journal of Computer Vision
Recovering Surface Layout from an Image

International Journal of Computer Vision
POP: Patchwork of Parts Models for Object Recognition

International Journal of Computer Vision
A stochastic grammar of images

Foundations and Trends® in Computer Graphics and Vision
Shape matching and registration by data-driven EM

Computer Vision and Image Understanding
Learning Probabilistic Models for Contour Completion in Natural Images

International Journal of Computer Vision
Describing Visual Scenes Using Transformed Objects and Parts

International Journal of Computer Vision
Putting Objects in Perspective

International Journal of Computer Vision
An efficient algorithm for attention-driven image interpretation from segments

Pattern Recognition
Contour Grouping with Partial Shape Similarity

PSIVT '09 Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology
A stochastic graph grammar for compositional object representation and recognition

Pattern Recognition
Contour Grouping Based on Contour-Skeleton Duality

International Journal of Computer Vision
Qualitative spatial relationships for image interpretation by using a conceptual graph

Image and Vision Computing
Regional category parsing in undirected graphical models

Pattern Recognition Letters
Unsupervised modeling of objects and their hierarchical contextual interactions

Journal on Image and Video Processing - Special issue on patches in vision
The generalized A* architecture

Journal of Artificial Intelligence Research
A new compositional technique for hand posture recognition

ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
From image parsing to painterly rendering

ACM Transactions on Graphics (TOG)
Textual description of shapes

Journal of Visual Communication and Image Representation
Using visual context and region semantics for high-level concept detection

IEEE Transactions on Multimedia - Special issue on integration of context and content
Preferential image segmentation using trees of shapes

IEEE Transactions on Image Processing
Object detection using spatial histogram features

Image and Vision Computing
The framework of target recognition in the field of remote sensing image based on knowledge

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
An improved edge-based text region segmentation algorithm applied to slab image data from steel plant

CGIM '08 Proceedings of the Tenth IASTED International Conference on Computer Graphics and Imaging
Coupled region-edge shape priors for simultaneous localization and figure-ground segmentation

Pattern Recognition
Learning 3D mesh segmentation and labeling

ACM SIGGRAPH 2010 papers
Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Compositional object recognition, segmentation, and tracking in video

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Bottom-up and top-down object matching using asynchronous agents and a contrario principles

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Sisley the abstract painter

NPAR '10 Proceedings of the 8th International Symposium on Non-Photorealistic Animation and Rendering
Detecting object boundaries using low-, mid-, and high-level information

Computer Vision and Image Understanding
Geometric image parsing in man-made environments

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
What, where and how many? combining object detectors and CRFs

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Monocular 3D scene modeling and inference: understanding multi-object traffic scenes

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Thinking inside the box: using appearance models and context based on room geometry

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Inference scene labeling by incorporating object detection with explicit shape model

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
A Numerical Study of the Bottom-Up and Top-Down Inference Processes in And-Or Graphs

International Journal of Computer Vision
Inference and Learning with Hierarchical Shape Models

International Journal of Computer Vision
Fast object detection using steiner tree

Machine Graphics & Vision International Journal
The complex wave representation of distance transforms

EMMCVPR'11 Proceedings of the 8th international conference on Energy minimization methods in computer vision and pattern recognition
Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets

International Journal of Computer Vision
What can we learn from biological vision studies for human motion segmentation?

ISVC'06 Proceedings of the Second international conference on Advances in Visual Computing - Volume Part II
Learning and incorporating top-down cues in image segmentation

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Harmony Potentials

International Journal of Computer Vision
Automatic image segmentation by positioning a seed

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Image label completion by pursuing contextual decomposability

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Skull-closed autonomous development

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part I
A probabilistic model for component-based shape synthesis

ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Structured Learning and Prediction in Computer Vision

Foundations and Trends® in Computer Graphics and Vision
Understanding web images by object relation network

Proceedings of the 21st international conference on World Wide Web
Geometric Image Parsing in Man-Made Environments

International Journal of Computer Vision
Learning a generative model of images by factoring appearance and shape

Neural Computation
Discriminative Appearance Models for Pictorial Structures

International Journal of Computer Vision
Hough regions for joining instance localization and segmentation

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Segmentation propagation in imagenet

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VII
Abstract painting with interactive control of perceptual entropy

ACM Transactions on Applied Perception (TAP)
Simultaneous Bayesian clustering and feature selection using RJMCMC-based learning of finite generalized Dirichlet mixture models

Signal Processing
A constraint propagation approach to structural model based image segmentation and recognition

Information Sciences: an International Journal
Selective Search for Object Recognition

International Journal of Computer Vision
Automated segmentation of the cerebellar lobules using boundary specific classification and evolution

IPMI'13 Proceedings of the 23rd international conference on Information Processing in Medical Imaging
Learning discriminative localization from weakly labeled data

Pattern Recognition
Object Bank: An Object-Level Image Representation for High-Level Visual Recognition

International Journal of Computer Vision
Learning what is where from unlabeled images: joint localization and clustering of foreground objects

Machine Learning

Quantified Score

Hi-index	0.02

Visualization

Abstract

In this paper we present a Bayesian framework for parsing images into their constituent visual patterns. The parsing algorithm optimizes the posterior probability and outputs a scene representation as a "parsing graph", in a spirit similar to parsing sentences in speech and natural language. The algorithm constructs the parsing graph and re-configures it dynamically using a set of moves, which are mostly reversible Markov chain jumps. This computational framework integrates two popular inference approaches--generative (top-down) methods and discriminative (bottom-up) methods. The former formulates the posterior probability in terms of generative models for images defined by likelihood functions and priors. The latter computes discriminative probabilities based on a sequence (cascade) of bottom-up tests/filters. In our Markov chain algorithm design, the posterior probability, defined by the generative models, is the invariant (target) probability for the Markov chain, and the discriminative probabilities are used to construct proposal probabilities to drive the Markov chain. Intuitively, the bottom-up discriminative probabilities activate top-down generative models. In this paper, we focus on two types of visual patterns--generic visual patterns, such as texture and shading, and object patterns including human faces and text. These types of patterns compete and cooperate to explain the image and so image parsing unifies image segmentation, object detection, and recognition (if we use generic visual patterns only then image parsing will correspond to image segmentation (Tu and Zhu, 2002. IEEE Trans. PAMI, 24(5):657--673). We illustrate our algorithm on natural images of complex city scenes and show examples where image segmentation can be improved by allowing object specific knowledge to disambiguate low-level segmentation cues, and conversely where object detection can be improved by using generic visual patterns to explain away shadows and occlusions.