IEEE Transactions on Pattern Analysis and Machine Intelligence
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons
International Journal of Computer Vision
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Discovering Objects and their Localization in Images
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Object Categorization by Learned Universal Visual Dictionary
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval
International Journal of Computer Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence
Building compact local pairwise codebook with joint feature space clustering
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Efficient object category recognition using classemes
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Image classification using super-vector coding of local image descriptors
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Probabilistic latent semantic analysis
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Visual and semantic similarity in ImageNet
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Support vector machines for histogram-based image classification
IEEE Transactions on Neural Networks
Learning attribute-aware dictionary for image classification and search
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Violent scene detection using mid-level feature
Proceedings of the Fourth Symposium on Information and Communication Technology
Hi-index | 0.00 |
The Bag-of-Words (BoW) model--commonly used for image classification--has two strong limitations: on one hand, visual words are lacking of explicit meanings, on the other hand, they are usually polysemous. This paper proposes to address these two limitations by introducing an intermediate representation based on the use of semantic attributes. Specifically, two different approaches are proposed. Both approaches consist in predicting a set of semantic attributes for the entire images as well as for local image regions, and in using these predictions to build the intermediate level features. Experiments on four challenging image databases (PASCAL VOC 2007, Scene-15, MSRCv2 and SUN-397) show that both approaches improve performance of the BoW model significantly. Moreover, their combination achieves the state-of-the-art results on several of these image databases.