Scale-Space and Edge Detection Using Anisotropic Diffusion
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Design and Use of Steerable Filters
IEEE Transactions on Pattern Analysis and Machine Intelligence
WordNet: a lexical database for English
Communications of the ACM
Saliency, Scale and Image Description
International Journal of Computer Vision
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons
International Journal of Computer Vision
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision
Shape Matching and Object Recognition Using Shape Contexts
IEEE Transactions on Pattern Analysis and Machine Intelligence
An Affine Invariant Interest Point Detector
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Object Recognition from Local Scale-Invariant Features
ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
ACM SIGGRAPH 2005 Papers
Learning Hierarchical Models of Scenes, Objects, and Parts
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
One-Shot Learning of Object Categories
IEEE Transactions on Pattern Analysis and Machine Intelligence
Putting Objects in Perspective
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Context and Hierarchy in a Probabilistic Image Model
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Multimedia semantic indexing using model vectors
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Computer
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News
IEEE Transactions on Multimedia
Synthetically trained multi-view object class and viewpoint detection for advanced image retrieval
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Scene classification using a multi-resolution bag-of-features model
Pattern Recognition
Script data for attribute-based recognition of composite activities
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Latent pyramidal regions for recognizing scenes
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Atomic action features: a new feature for action recognition
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
Multi-Level structured image coding on high-dimensional image representation
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Personalized image recommendation and retrieval via latent SVM based model
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Object coding on the semantic graph for scene classification
Proceedings of the 21st ACM international conference on Multimedia
Multimedia search reranking: A literature survey
ACM Computing Surveys (CSUR)
Violent scene detection using mid-level feature
Proceedings of the Fourth Symposium on Information and Communication Technology
Object Bank: An Object-Level Image Representation for High-Level Visual Recognition
International Journal of Computer Vision
Hi-index | 0.00 |
Robust low-level image features have proven to be effective representations for a variety of high-level visual recognition tasks, such as object recognition and scene classification. But as the visual recognition tasks become more challenging, the semantic gap between low-level feature representation and the meaning of the scenes increases. In this paper, we propose to use objects as attributes of scenes for scene classification. We represent images by collecting their responses to a large number of object detectors, or "object filters". Such representation carries high-level semantic information rather than low-level image feature information, making it more suitable for high-level visual recognition tasks. Using very simple, off-the-shelf classifiers such as SVM, we show that this object-level image representation can be used effectively for high-level visual tasks such as scene classification. Our results are superior to reported state-of-the-art performance on a number of standard datasets.