Modeling visual attention via selective tuning
Artificial Intelligence - Special volume on computer vision
Artificial Intelligence Review - Special issue on lazy learning
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision
Selection of Scale-Invariant Parts for Object Class Recognition
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
A Sparse Texture Representation Using Local Affine Regions
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Performance Evaluation of Local Descriptors
IEEE Transactions on Pattern Analysis and Machine Intelligence
Creating Efficient Codebooks for Visual Recognition
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Object Categorization by Learned Universal Visual Dictionary
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Boosting Color Saliency in Image Feature Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparison of Affine Region Detectors
International Journal of Computer Vision
A Coherent Computational Approach to Model Bottom-Up Visual Attention
IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Visual Vocabulary for Flower Classification
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval
International Journal of Computer Vision
2006 Special Issue: Modeling attention to salient proto-objects
Neural Networks
More efficiency in multiple kernel learning
Proceedings of the 24th international conference on Machine learning
Representing shape with a spatial pyramid kernel
Proceedings of the 6th ACM international conference on Image and video retrieval
Scene Classification Using a Hybrid Generative/Discriminative Approach
IEEE Transactions on Pattern Analysis and Machine Intelligence
Universal and Adapted Vocabularies for Generic Visual Categorization
IEEE Transactions on Pattern Analysis and Machine Intelligence
Performance evaluation of local colour invariants
Computer Vision and Image Understanding
Localizing Objects with Smart Dictionaries
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Automated Flower Classification over a Large Number of Classes
ICVGIP '08 Proceedings of the 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing
IEEE Transactions on Pattern Analysis and Machine Intelligence
Supervised Learning of Quantizer Codebooks by Information Loss Minimization
IEEE Transactions on Pattern Analysis and Machine Intelligence
More generality in efficient multiple kernel learning
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Assessing the contribution of color in visual attention
Computer Vision and Image Understanding - Special issue: Attention and performance in computer vision
Learning color names for real-world applications
IEEE Transactions on Image Processing
Evaluating Color Descriptors for Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Object classification using heterogeneous co-occurrence features
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Image classification using super-vector coding of local image descriptors
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Event Recognition Based on Top-Down Motion Attention
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Top-down cues for event recognition
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Sampling strategies for bag-of-features image classification
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Natural scene image modeling using color and texture visterms
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Coloring local feature extraction
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Texton theory revisited: A bag-of-words approach to combine textons
Pattern Recognition
A new biologically inspired color image descriptor
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Fusing color and shape for bag-of-words based object recognition
CCIW'13 Proceedings of the 4th international conference on Computational Color Imaging
Coloring Action Recognition in Still Images
International Journal of Computer Vision
Hi-index | 0.00 |
Bag-of-words based image representation is a successful approach for object recognition. Generally, the subsequent stages of the process: feature detection, feature description, vocabulary construction and image representation are performed independent of the intentioned object classes to be detected. In such a framework, it was found that the combination of different image cues, such as shape and color, often obtains below expected results.This paper presents a novel method for recognizing object categories when using multiple cues by separately processing the shape and color cues and combining them by modulating the shape features by category-specific color attention. Color is used to compute bottom-up and top-down attention maps. Subsequently, these color attention maps are used to modulate the weights of the shape features. In regions with higher attention shape features are given more weight than in regions with low attention.We compare our approach with existing methods that combine color and shape cues on five data sets containing varied importance of both cues, namely, Soccer (color predominance), Flower (color and shape parity), PASCAL VOC 2007 and 2009 (shape predominance) and Caltech-101 (color co-interference). The experiments clearly demonstrate that in all five data sets our proposed framework significantly outperforms existing methods for combining color and shape information.