Fast Approximate Energy Minimization via Graph Cuts
IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning a Classification Model for Segmentation
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
What Energy Functions Can Be Minimizedvia Graph Cuts?
IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Graph-Based Image Segmentation
International Journal of Computer Vision
An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Recovering Surface Layout from an Image
International Journal of Computer Vision
LabelMe: A Database and Web-Based Tool for Image Annotation
International Journal of Computer Vision
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Spatial Context: Using Stuff to Find Things
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Segmentation and Recognition Using Structure from Motion Point Clouds
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Context based object categorization: A critical survey
Computer Vision and Image Understanding
What, where and how many? combining object detectors and CRFs
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Semantic segmentation of urban scenes using dense depth maps
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Superparsing: scalable nonparametric image parsing with superpixels
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Thinking inside the box: using appearance models and context based on room geometry
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Multiscale conditional random fields for image labeling
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
SIFT Flow: Dense Correspondence across Scenes and Its Applications
IEEE Transactions on Pattern Analysis and Machine Intelligence
Nonparametric Scene Parsing via Label Transfer
IEEE Transactions on Pattern Analysis and Machine Intelligence
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
From 3D scene geometry to human workspace
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Evaluation of super-voxel methods for early video processing
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Nonparametric image parsing using adaptive neighbor sets
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Understanding scenes on many levels
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Ensemble of exemplar-SVMs for object detection and beyond
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Class-Specified segmentation with multi-scale superpixels
ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume Part I
Hand segmentation for gesture recognition in EGO-vision
Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices
Hi-index | 0.00 |
This paper presents a simple and effective nonparametric approach to the problem of image parsing, or labeling image regions (in our case, superpixels produced by bottom-up segmentation) with their categories. This approach is based on lazy learning, and it can easily scale to datasets with tens of thousands of images and hundreds of labels. Given a test image, it first performs global scene-level matching against the training set, followed by superpixel-level matching and efficient Markov random field (MRF) optimization for incorporating neighborhood context. Our MRF setup can also compute a simultaneous labeling of image regions into semantic classes (e.g., tree, building, car) and geometric classes (sky, vertical, ground). Our system outperforms the state-of-the-art nonparametric method based on SIFT Flow on a dataset of 2,688 images and 33 labels. In addition, we report per-pixel rates on a larger dataset of 45,676 images and 232 labels. To our knowledge, this is the first complete evaluation of image parsing on a dataset of this size, and it establishes a new benchmark for the problem. Finally, we present an extension of our method to video sequences and report results on a video dataset with frames densely labeled at 1 Hz.