Object Bank: An Object-Level Image Representation for High-Level Visual Recognition

Authors:
Li-Jia Li;Hao Su;Yongwhan Lim;Li Fei-Fei
Affiliations:
Yahoo! Research, Sunnyvale, USA 94089;Computer Science Department, Stanford University, Stanford, USA 94305;Computer Science Department, Stanford University, Stanford, USA 94305;Computer Science Department, Stanford University, Stanford, USA 94305
Venue:
International Journal of Computer Vision
Year:
2014

Citing 23
Cited 0

Scale-Space and Edge Detection Using Anisotropic Diffusion

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Design and Use of Steerable Filters

IEEE Transactions on Pattern Analysis and Machine Intelligence
WordNet: a lexical database for English

Communications of the ACM
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons

International Journal of Computer Vision
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Image Parsing: Unifying Segmentation, Detection, and Recognition

International Journal of Computer Vision
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Learning Hierarchical Models of Scenes, Objects, and Parts

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
One-Shot Learning of Object Categories

IEEE Transactions on Pattern Analysis and Machine Intelligence
Putting Objects in Perspective

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Context and Hierarchy in a Probabilistic Image Model

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Games with a Purpose

Computer
Efficient object category recognition using classemes

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Kernel sparse representation for image classification and face recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Multi-layer group sparse coding -- For concurrent image classification and annotation

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Adapted Gaussian models for image classification

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Contextualizing object detection and classification

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News

IEEE Transactions on Multimedia
Scene recognition and weakly supervised object localization with deformable part-based models

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Objects as attributes for scene classification

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

It is a remarkable fact that images are related to objects constituting them. In this paper, we propose to represent images by using objects appearing in them. We introduce the novel concept of object bank (OB), a high-level image representation encoding object appearance and spatial location information in images. OB represents an image based on its response to a large number of pre-trained object detectors, or `object filters', blind to the testing dataset and visual recognition task. Our OB representation demonstrates promising potential in high level image recognition tasks. It significantly outperforms traditional low level image representations in image classification on various benchmark image datasets by using simple, off-the-shelf classification algorithms such as linear SVM and logistic regression. In this paper, we analyze OB in detail, explaining our design choice of OB for achieving its best potential on different types of datasets. We demonstrate that object bank is a high level representation, from which we can easily discover semantic information of unknown images. We provide guidelines for effectively applying OB to high level image recognition tasks where it could be easily compressed for efficient computation in practice and is very robust to various classifiers.