Modulating Shape Features by Color Attention for Object Recognition

Authors:
Fahad Shahbaz Khan;Joost Weijer;Maria Vanrell
Affiliations:
Computer Vision Centre Barcelona, Universitat Autonoma de Barcelona, Barcelona, Spain;Computer Vision Centre Barcelona, Universitat Autonoma de Barcelona, Barcelona, Spain;Computer Vision Centre Barcelona, Universitat Autonoma de Barcelona, Barcelona, Spain
Venue:
International Journal of Computer Vision
Year:
2012

Citing 42
Cited 4

Modeling visual attention via selective tuning

Artificial Intelligence - Special volume on computer vision
A Review and Empirical Evaluation of Feature Weighting Methods for aClass of Lazy Learning Algorithms

Artificial Intelligence Review - Special issue on lazy learning
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Selection of Scale-Invariant Parts for Object Class Recognition

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
A Sparse Texture Representation Using Local Affine Regions

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Creating Efficient Codebooks for Visual Recognition

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Object Categorization by Learned Universal Visual Dictionary

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Boosting Color Saliency in Image Feature Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparison of Affine Region Detectors

International Journal of Computer Vision
A Coherent Computational Approach to Model Bottom-Up Visual Attention

IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Visual Vocabulary for Flower Classification

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval

International Journal of Computer Vision
2006 Special Issue: Modeling attention to salient proto-objects

Neural Networks
More efficiency in multiple kernel learning

Proceedings of the 24th international conference on Machine learning
Representing shape with a spatial pyramid kernel

Proceedings of the 6th ACM international conference on Image and video retrieval
Scene Classification Using a Hybrid Generative/Discriminative Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Universal and Adapted Vocabularies for Generic Visual Categorization

IEEE Transactions on Pattern Analysis and Machine Intelligence
Performance evaluation of local colour invariants

Computer Vision and Image Understanding
Localizing Objects with Smart Dictionaries

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Automated Flower Classification over a Large Number of Classes

ICVGIP '08 Proceedings of the 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing
Discriminant Saliency, the Detection of Suspicious Coincidences, and Applications to Visual Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Supervised Learning of Quantizer Codebooks by Information Loss Minimization

IEEE Transactions on Pattern Analysis and Machine Intelligence
More generality in efficient multiple kernel learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Assessing the contribution of color in visual attention

Computer Vision and Image Understanding - Special issue: Attention and performance in computer vision
Learning color names for real-world applications

IEEE Transactions on Image Processing
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Object classification using heterogeneous co-occurrence features

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Image classification using super-vector coding of local image descriptors

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Event Recognition Based on Top-Down Motion Attention

ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Top-down cues for event recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Sampling strategies for bag-of-features image classification

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Natural scene image modeling using color and texture visterms

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Coloring local feature extraction

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II

Texton theory revisited: A bag-of-words approach to combine textons

Pattern Recognition
A new biologically inspired color image descriptor

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Fusing color and shape for bag-of-words based object recognition

CCIW'13 Proceedings of the 4th international conference on Computational Color Imaging
Coloring Action Recognition in Still Images

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

Bag-of-words based image representation is a successful approach for object recognition. Generally, the subsequent stages of the process: feature detection, feature description, vocabulary construction and image representation are performed independent of the intentioned object classes to be detected. In such a framework, it was found that the combination of different image cues, such as shape and color, often obtains below expected results.This paper presents a novel method for recognizing object categories when using multiple cues by separately processing the shape and color cues and combining them by modulating the shape features by category-specific color attention. Color is used to compute bottom-up and top-down attention maps. Subsequently, these color attention maps are used to modulate the weights of the shape features. In regions with higher attention shape features are given more weight than in regions with low attention.We compare our approach with existing methods that combine color and shape cues on five data sets containing varied importance of both cues, namely, Soccer (color predominance), Flower (color and shape parity), PASCAL VOC 2007 and 2009 (shape predominance) and Caltech-101 (color co-interference). The experiments clearly demonstrate that in all five data sets our proposed framework significantly outperforms existing methods for combining color and shape information.