Sampling strategies for bag-of-features image classification

Authors:
Eric Nowak;Frédéric Jurie;Bill Triggs
Affiliations:
GRAVIR-CNRS-INRIA, Montbonnot, France;GRAVIR-CNRS-INRIA, Montbonnot, France;GRAVIR-CNRS-INRIA, Montbonnot, France
Venue:
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Year:
2006

Citing 17
Cited 116

Detecting salient blob-like image structures and their scales with a scale-space primal sketch: a method for focus-of-attention

International Journal of Computer Vision
The Earth Mover's Distance as a Metric for Image Retrieval

International Journal of Computer Vision
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons

International Journal of Computer Vision
An Affine Invariant Interest Point Detector

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Unsupervised Learning of Models for Recognition

ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part I
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Affine-Invariant Local Descriptors and Neighborhood Statistics for Texture Recognition

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Learning to Detect Objects in Images via a Sparse, Part-Based Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Hierarchical Part-Based Visual Object Categorization

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Efficient Image Matching with Distributions of Local Invariant Features

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Creating Efficient Codebooks for Visual Recognition

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Object Categories from Google"s Image Search

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Object Categorization by Learned Universal Visual Dictionary

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
A Comparison of Affine Region Detectors

International Journal of Computer Vision
Hyperfeatures – multilevel local coding for visual recognition

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I

Towards optimal bag-of-features for object categorization and semantic video retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Searching for repeated video sequences

Proceedings of the international workshop on Workshop on multimedia information retrieval
Object categorization

Foundations and Trends® in Computer Graphics and Vision
Context-Based Object-Class Recognition and Retrieval by Generalized Correlograms

IEEE Transactions on Pattern Analysis and Machine Intelligence
A comprehensive review of current local features for computer vision

Neurocomputing
Object retrieval using configurations of salient regions

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
A comparison of color features for visual concept classification

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Fast nearest neighbor retrieval for bregman divergences

Proceedings of the 25th international conference on Machine learning
Local invariant feature detectors: a survey

Foundations and Trends® in Computer Graphics and Vision
Discriminative cue integration for medical image annotation

Pattern Recognition Letters
Learning Distance Functions for Automatic Annotation of Images

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Sliding-Windows for Rapid Object Class Localization: A Parallel Technique

Proceedings of the 30th DAGM symposium on Pattern Recognition
Spirittagger: a geo-aware tag suggestion tool mined from flickr

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Semantic lattices for multiple annotation of images

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Video object matching across multiple independent views using local descriptors and adaptive learning

Pattern Recognition Letters
Learning to Localize Objects with Structured Output Regression

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Material-specific adaptation of color invariant features

Pattern Recognition Letters
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval

Computer Vision and Image Understanding
Mining the web for visual concepts

Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Object localisation using the Generative Template of Features

Computer Vision and Image Understanding
Multi-class image segmentation using conditional random fields and global classification

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Drosophila gene expression pattern annotation using sparse features and term-term interactions

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference

ICB '09 Proceedings of the Third International Conference on Advances in Biometrics
Bag-of-Features Codebook Generation by Self-Organisation

WSOM '09 Proceedings of the 7th International Workshop on Advances in Self-Organizing Maps
Histopathology Image Classification Using Bag of Features and Kernel Functions

AIME '09 Proceedings of the 12th Conference on Artificial Intelligence in Medicine: Artificial Intelligence in Medicine
Foreground Focus: Unsupervised Learning from Partially Matching Images

International Journal of Computer Vision
Kernel Methods in Computer Vision

Foundations and Trends® in Computer Graphics and Vision
Global annotation on georeferenced photographs

Proceedings of the ACM International Conference on Image and Video Retrieval
A visual analysis of the relationship between word concepts and geographical locations

Proceedings of the ACM International Conference on Image and Video Retrieval
Dense sampling low-level statistics of local features

Proceedings of the ACM International Conference on Image and Video Retrieval
Semi-supervised learning of visual classifiers from web images and text

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Efficient Object Pixel-Level Categorization Using Bag of Features

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part I
Web image gathering with region-based bag-of-features and multiple instance learning

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
An analysis of the relation between visual concepts and geo-locations using geotagged images on the web

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
X-ray image categorization and retrieval using patch-based visual words representation

ISBI'09 Proceedings of the Sixth IEEE international conference on Symposium on Biomedical Imaging: From Nano to Macro
Data-driven grasping with partial sensor data

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Interactive image segmentation using probabilistic hypergraphs

Pattern Recognition
Chest x-ray characterization: from organ identification to pathology categorization

Proceedings of the international conference on Multimedia information retrieval
Region-based automatic web image selection

Proceedings of the international conference on Multimedia information retrieval
Towards pose-invariant 2D face classification for surveillance

AMFG'07 Proceedings of the 3rd international conference on Analysis and modeling of faces and gestures
Content-based image retrieval by indexing random subwindows with randomized trees

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
Web image gathering with a part-based object recognition method

MMM'08 Proceedings of the 14th international conference on Advances in multimedia modeling
A tale of two object recognition methods for mobile robots

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Multiple order gradient feature for macro-invertebrate identification using support vector machines

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Web-scale computer vision using MapReduce for multimedia data mining

Proceedings of the Tenth International Workshop on Multimedia Data Mining
Expanded bag of words representation for object classification

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Color constancy using stage classification

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Novel local features with hybrid sampling technique for image retrieval

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Unsupervised object category discovery via information bottleneck method

Proceedings of the international conference on Multimedia
e-Silkroad: a sample of combining social media with cultural tourism

Proceedings of the 1st ACM international workshop on Connected multimedia
Dense simple features for fast and accurate medical X-ray annotation

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Weighted symbols-based edit distance for string-structured image classification

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Real-Time Object Segmentation Using a Bag of Features Approach

Proceedings of the 2010 conference on Artificial Intelligence Research and Development: Proceedings of the 13th International Conference of the Catalan Association for Artificial Intelligence
Randomized locality sensitive vocabularies for bag-of-features model

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Optimal operations for visual categorization

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Event detection and recognition for semantic annotation of video

Multimedia Tools and Applications
Linear discriminant analysis for signatures

IEEE Transactions on Neural Networks
Histopathological image classification using stain component features on a pLSA model

CIARP'10 Proceedings of the 15th Iberoamerican congress conference on Progress in pattern recognition, image analysis, computer vision, and applications
A system for colorectal tumor classification in magnifying endoscopic NBI images

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Robust tracking based on pixel-wise spatial pyramid and biased fusion

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
Region Contextual Visual Words for scene categorization

Expert Systems with Applications: An International Journal
An efficient image classifier using discrete cosine transform

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Learning reconfigurable hashing for diverse semantics

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Graph-based methods for the automatic annotation and retrieval of art prints

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Data-driven grasping

Autonomous Robots
Measuring the coverage of interest point detectors

ICIAR'11 Proceedings of the 8th international conference on Image analysis and recognition - Volume Part I
Visual words on baggage X-ray images

CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Random interest regions for object recognition based on texture descriptors and bag of features

Expert Systems with Applications: An International Journal
An extended-HCT semantic description for visual place recognition

International Journal of Robotics Research
On the spatial extents of SIFT descriptors for visual concept detection

ICVS'11 Proceedings of the 8th international conference on Computer vision systems
An evaluation of local interest regions for non-rigid object class recognition

Expert Systems with Applications: An International Journal
Recognizing clothes patterns for blind people by confidence margin based feature combination

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Semantic point detector

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Combining latent semantic learning and reduced hypergraph learning for semi-supervised image categorization

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Text and image subject classifiers: dense works better

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Images as sets of locally weighted features

Computer Vision and Image Understanding
New color image histogram-based detectors

IVIC'11 Proceedings of the Second international conference on Visual informatics: sustaining research and innovations - Volume Part I
Harmony Potentials

International Journal of Computer Vision
The Visual Extent of an Object

International Journal of Computer Vision
Image classification using probability higher-order local auto-correlations

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
Johnny: An Autonomous Service Robot for Domestic Environments

Journal of Intelligent and Robotic Systems
Face recognition using the POEM descriptor

Pattern Recognition
Categorization of multiple objects in a scene without semantic segmentation

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
A study on sampling strategies in space-time domain for recognition applications

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Improving tracking algorithms using saliency

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
A local spectral distribution approach to face recognition

Computer Vision and Image Understanding
Modulating Shape Features by Color Attention for Object Recognition

International Journal of Computer Vision
Image representation for generic object recognition using higher-order local autocorrelation features on posterior probability images

Pattern Recognition
SUPER: towards real-time event recognition in internet videos

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Segmentation-based multi-class semantic object detection

Multimedia Tools and Applications
Content-based image retrieval using color difference histogram

Pattern Recognition
Multi-view action recognition using local similarity random forests and sensor fusion

Pattern Recognition Letters
Stratified sampling for feature subspace selection in random forests for high dimensional data

Pattern Recognition
Bag of spatio-visual words for context inference in scene classification

Pattern Recognition
Nested sparse quantization for efficient feature coding

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Scene recognition on the semantic manifold

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Exploring bag of words architectures in the facial expression domain

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume 2
Enhanced representation and multi-task learning for image annotation

Computer Vision and Image Understanding
Heterogeneous bag-of-features for object/scene recognition

Applied Soft Computing
Action recognition using canonical correlation kernels

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Efficient development of user-defined image recognition systems

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume Part I
Bubble space and place representation in topological maps

International Journal of Robotics Research
Automatic Annotation of Scientific Video Material based on Visual Concept Detection

Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies
SmartVisionApp: A framework for computer vision applications on mobile devices

Expert Systems with Applications: An International Journal
Multiple instance classification: Review, taxonomy and comparative study

Artificial Intelligence
Object class detection: A survey

ACM Computing Surveys (CSUR)
Integrating cue descriptors in bubble space for place recognition

ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems
Spatiotemporal bag-of-features for early wildfire smoke detection

Image and Vision Computing
Learning group-based dictionaries for discriminative image representation

Pattern Recognition
Towards robust object categorization for mobile robots with combination of classifiers

Robot Soccer World Cup XV
CM-BOF: visual similarity-based 3D shape retrieval using Clock Matching and Bag-of-Features

Machine Vision and Applications
Learning structured visual dictionary for object tracking

Image and Vision Computing
Structured representations in a content based image retrieval context

Journal of Visual Communication and Image Representation
Language-motivated approaches to action recognition

The Journal of Machine Learning Research
Visual words dictionaries and fusion techniques for searching people through textual and visual attributes

Pattern Recognition Letters
A co-boost framework for learning object categories from Google Images with 1st and 2nd order features

The Visual Computer: International Journal of Computer Graphics

Quantified Score

Hi-index	0.01

Visualization

Abstract

Bag-of-features representations have recently become popular for content based image classification owing to their simplicity and good performance. They evolved from texton methods in texture analysis. The basic idea is to treat images as loose collections of independent patches, sampling a representative set of patches from the image, evaluating a visual descriptor vector for each patch independently, and using the resulting distribution of samples in descriptor space as a characterization of the image. The four main implementation choices are thus how to sample patches, how to describe them, how to characterize the resulting distributions and how to classify images based on the result. We concentrate on the first issue, showing experimentally that for a representative selection of commonly used test databases and for moderate to large numbers of samples, random sampling gives equal or better classifiers than the sophisticated multiscale interest operators that are in common use. Although interest operators work well for small numbers of samples, the single most important factor governing performance is the number of patches sampled from the test image and ultimately interest operators can not provide enough patches to compete. We also study the influence of other factors including codebook size and creation method, histogram normalization method and minimum scale for feature extraction.