Bottom-up/Top-Down Image Parsing by Attribute Graph Grammar

Authors:
Feng Han;Song-Chun Zhu
Affiliations:
University of California at Los Angeles;University of California at Los Angeles
Venue:
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Year:
2005

Citing 0
Cited 24

Efficient search and verification for function based classification from real range images

Computer Vision and Image Understanding
Image-based procedural modeling of facades

ACM SIGGRAPH 2007 papers
Recovering Surface Layout from an Image

International Journal of Computer Vision
A stochastic grammar of images

Foundations and Trends® in Computer Graphics and Vision
Static and dynamic abstract formal models for 3D sensor images

WSEAS TRANSACTIONS on SYSTEMS
Semantic event representation and recognition using syntactic attribute graph grammar

Pattern Recognition Letters
Analysis of Building Textures for Reconstructing Partially Occluded Facades

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
A stochastic graph grammar for compositional object representation and recognition

Pattern Recognition
A perception oriented formal model for 3D sensor depth images

ICS'08 Proceedings of the 12th WSEAS international conference on Systems
Multiple 3D sensor views object models correspondence

ICS'09 Proceedings of the 13th WSEAS international conference on Systems
A new compositional technique for hand posture recognition

ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
A Hierarchical and Contextual Model for Aerial Image Parsing

International Journal of Computer Vision
Object category recognition using generative template boosting

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Contour based object detection using part bundles

Computer Vision and Image Understanding
Insertion of 3-D-primitives in mesh-based representations: towards compact models preserving the details

IEEE Transactions on Image Processing
Visual alphabets on different levels of abstraction for the recognition of deformable objects

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Towards recognition of degraded words by probabilistic parsing

Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
A visual grammar for face detection

IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Inference and Learning with Hierarchical Shape Models

International Journal of Computer Vision
Detecting instances of shape classes that exhibit variable structure

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Explaining Activities as Consistent Groups of Events

International Journal of Computer Vision
Object categorization with sketch representation and generalized samples

Pattern Recognition
Supervised geodesic propagation for semantic label transfer

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Using grammars for pattern recognition in images: A systematic review

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present an attribute graph grammar for image parsing on scenes with man-made objects, such as buildings, hallways, kitchens, and living rooms. We choose one class of primitives 驴 3D planar rectangles projected on images, and six graph grammar production rules. Each production rule not only expands a node into its components, but also includes a number of equations that constrain the attributes of a parent node and those of its children. Thus our graph grammar is context sensitive. The grammar rules are used recursively to produce a large number of objects and patterns in images and thus the whole graph grammar is a type of generative model. The inference algorithm integrates bottom-up rectangle detection which activates top-down prediction using the grammar rules. The final results are validated in a Bayesian framework. The output of the inference is a hierarchical parsing graph with objects, surfaces, rectangles, and their spatial relations. In the inference, the acceptance of a grammar rule means a recognition of an object, and actions are taken to pass the attributes between a node and its parent through the constraint equations associated with this production rule. When an attribute is passed from a child node to a parent node, it is called bottom-up, and the opposite is called top-down.