Superparsing

Authors:
Joseph Tighe;Svetlana Lazebnik
Affiliations:
Computer Science Department, University of North Carolina, Chapel Hill, USA;Computer Science Department, University of North Carolina, Chapel Hill, USA
Venue:
International Journal of Computer Vision
Year:
2013

Citing 26
Cited 2

Fast Approximate Energy Minimization via Graph Cuts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning a Classification Model for Segmentation

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
What Energy Functions Can Be Minimizedvia Graph Cuts?

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Graph-Based Image Segmentation

International Journal of Computer Vision
An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision

IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Recovering Surface Layout from an Image

International Journal of Computer Vision
LabelMe: A Database and Web-Based Tool for Image Annotation

International Journal of Computer Vision
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Spatial Context: Using Stuff to Find Things

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Segmentation and Recognition Using Structure from Motion Point Clouds

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Context based object categorization: A critical survey

Computer Vision and Image Understanding
What, where and how many? combining object detectors and CRFs

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Semantic segmentation of urban scenes using dense depth maps

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Superparsing: scalable nonparametric image parsing with superpixels

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Thinking inside the box: using appearance models and context based on room geometry

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Multiscale conditional random fields for image labeling

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
SIFT Flow: Dense Correspondence across Scenes and Its Applications

IEEE Transactions on Pattern Analysis and Machine Intelligence
Nonparametric Scene Parsing via Label Transfer

IEEE Transactions on Pattern Analysis and Machine Intelligence
TextonBoost: joint appearance, shape and context modeling for multi-class object recognition and segmentation

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
From 3D scene geometry to human workspace

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Evaluation of super-voxel methods for early video processing

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Nonparametric image parsing using adaptive neighbor sets

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Understanding scenes on many levels

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Ensemble of exemplar-SVMs for object detection and beyond

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Decision tree fields

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision

Class-Specified segmentation with multi-scale superpixels

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume Part I
Hand segmentation for gesture recognition in EGO-vision

Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a simple and effective nonparametric approach to the problem of image parsing, or labeling image regions (in our case, superpixels produced by bottom-up segmentation) with their categories. This approach is based on lazy learning, and it can easily scale to datasets with tens of thousands of images and hundreds of labels. Given a test image, it first performs global scene-level matching against the training set, followed by superpixel-level matching and efficient Markov random field (MRF) optimization for incorporating neighborhood context. Our MRF setup can also compute a simultaneous labeling of image regions into semantic classes (e.g., tree, building, car) and geometric classes (sky, vertical, ground). Our system outperforms the state-of-the-art nonparametric method based on SIFT Flow on a dataset of 2,688 images and 33 labels. In addition, we report per-pixel rates on a larger dataset of 45,676 images and 232 labels. To our knowledge, this is the first complete evaluation of image parsing on a dataset of this size, and it establishes a new benchmark for the problem. Finally, we present an extension of our method to video sequences and report results on a video dataset with frames densely labeled at 1 Hz.