Images as sets of locally weighted features

Authors:
Teófilo de Campos;Gabriela Csurka;Florent Perronnin
Affiliations:
CVSSP, University of Surrey, Guildford, Surrey GU2 7XH, UK and Xerox Research Centre Europe, 6, chemin de Maupertuis, 38240 Meylan, France;CVSSP, University of Surrey, Guildford, Surrey GU2 7XH, UK and Xerox Research Centre Europe, 6, chemin de Maupertuis, 38240 Meylan, France;CVSSP, University of Surrey, Guildford, Surrey GU2 7XH, UK and Xerox Research Centre Europe, 6, chemin de Maupertuis, 38240 Meylan, France
Venue:
Computer Vision and Image Understanding
Year:
2012

Citing 28
Cited 4

Saliency, Scale and Image Description

International Journal of Computer Vision
Eigenfaces Versus Eigeneyes: First Steps Toward Performance Assessment of Representations for Face Recognition

MICAI '00 Proceedings of the Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Object Recognition with Informative Features and Linear Classification

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Automatic thumbnail cropping and its effectiveness

Proceedings of the 16th annual ACM symposium on User interface software and technology
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Sparse Multinomial Logistic Regression: Fast Algorithms and Generalization Bounds

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Object Categorization by Learned Universal Visual Dictionary

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
2006 Special Issue: Modeling attention to salient proto-objects

Neural Networks
Attention-based similarity

Pattern Recognition
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Asirra: a CAPTCHA that exploits interest-aligned manual image categorization

Proceedings of the 14th ACM conference on Computer and communications security
Universal and Adapted Vocabularies for Generic Visual Categorization

IEEE Transactions on Pattern Analysis and Machine Intelligence
Determining Patch Saliency Using Low-Level Context

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Automated Flower Classification over a Large Number of Classes

ICVGIP '08 Proceedings of the 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing
A Comparison of Feature Detectors with Passive and Task-Based Visual Saliency

SCIA '09 Proceedings of the 16th Scandinavian Conference on Image Analysis
Spatial extensions to bag of visual words

Proceedings of the ACM International Conference on Image and Video Retrieval
Computational visual attention systems and their cognitive foundations: A survey

ACM Transactions on Applied Perception (TAP)
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
Visual Word Ambiguity

IEEE Transactions on Pattern Analysis and Machine Intelligence
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Image classification using super-vector coding of local image descriptors

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Is bottom-up attention useful for object recognition?

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Use of image regions in context-adaptive image classification

SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
Sampling strategies for bag-of-features image classification

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Modeling the spatial layout of images beyond spatial pyramids

Pattern Recognition Letters

Modeling the spatial layout of images beyond spatial pyramids

Pattern Recognition Letters
Enhanced representation and multi-task learning for image annotation

Computer Vision and Image Understanding
Efficient image signatures and similarities using tensor products of local descriptors

Computer Vision and Image Understanding
Multi-spectral dataset and its application in saliency detection

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a generic framework in which images are modelled as order-less sets of weighted visual features. Each visual feature is associated with a weight factor that may inform its relevance. This framework can be applied to various bag-of-features approaches such as the bag-of-visual-word or the Fisher kernel representations. We suggest that if dense sampling is used, different schemes to weight local features can be evaluated, leading to results that are often better than the combination of multiple sampling schemes, at a much lower computational cost, because the features are extracted only once. This allows our framework to be a test-bed for saliency estimation methods in image categorisation tasks. We explored two main possibilities for the estimation of local feature relevance. The first one is based on the use of saliency maps obtained from human feedback, either by gaze tracking or by mouse clicks. The method is able to profit from such maps, leading to a significant improvement in categorisation performance. The second possibility is based on automatic saliency estimation methods, including Itti & Koch's method and SIFT's DoG. We evaluated the proposed framework and saliency estimation methods using an in house dataset and the PASCAL VOC 2008/2007 dataset, showing that some of the saliency estimation methods lead to a significant performance improvement in comparison to the standard unweighted representation.