Creating Efficient Codebooks for Visual Recognition

Authors:
Frederic Jurie;Bill Triggs
Affiliations:
GRAVIR-INRIA-CNRS;GRAVIR-INRIA-CNRS
Venue:
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Year:
2005

Citing 0
Cited 120

Multimodal fusion using learned text concepts for image categorization

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Detecting Irregularities in Images and in Video

International Journal of Computer Vision
Kernels on bags for multi-object database retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Multi-level local descriptor quantization for bag-of-visterms image representation

Proceedings of the 6th ACM international conference on Image and video retrieval
Groups of Adjacent Contour Segments for Object Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multilevel Image Coding with Hyperfeatures

International Journal of Computer Vision
Learning function-based object classification from 3D imagery

Computer Vision and Image Understanding
Object retrieval using configurations of salient regions

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
A comparison of color features for visual concept classification

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Local invariant feature detectors: a survey

Foundations and Trends® in Computer Graphics and Vision
Constructing visual phrases for effective and efficient object-based image retrieval

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Learning Distance Functions for Automatic Annotation of Images

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Automatic Image Annotation Using a Visual Dictionary Based on Reliable Image Segmentation

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Performance evaluation of local colour invariants

Computer Vision and Image Understanding
Material-specific adaptation of color invariant features

Pattern Recognition Letters
Validating the Detection of Everyday Concepts in Visual Lifelogs

SAMT '08 Proceedings of the 3rd International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
Similarity Measure of the Visual Features Using the Constrained Hierarchical Clustering for Content Based Image Retrieval

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing, Part II
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval

Computer Vision and Image Understanding
Latent Style Model: Discovering writing styles for calligraphy works

Journal of Visual Communication and Image Representation
Embedded lattices tree: An efficient indexing scheme for content based retrieval on image databases

Journal of Visual Communication and Image Representation
Latent mixture vocabularies for object categorization and segmentation

Image and Vision Computing
Learning non-redundant codebooks for classifying complex objects

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Bag-of-Features Codebook Generation by Self-Organisation

WSOM '09 Proceedings of the 7th International Workshop on Advances in Self-Organizing Maps
A Bag of Strings Representation for Image Categorization

Journal of Mathematical Imaging and Vision
Descriptive visual words and visual phrases for image applications

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Real-time bag of words, approximately

Proceedings of the ACM International Conference on Image and Video Retrieval
Automatic fruit and vegetable classification from images

Computers and Electronics in Agriculture
A Bag of Features Approach for 3D Shape Retrieval

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part I
Visual word pairs for automatic image annotation

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Advanced Techniques in CBIR: Local Descriptors, Visual Dictionaries and Bags of Features

SIBGRAPI-TUTORIALS '09 Proceedings of the 2009 Tutorials of the XXII Brazilian Symposium on Computer Graphics and Image Processing
Hierarchical appearance-based classifiers for qualitative spatial localization

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Audio/video fusion for objects recognition

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Automatically annotating the MIR Flickr dataset: experimental protocols, openly available data and semantic spaces

Proceedings of the international conference on Multimedia information retrieval
Category Level Object Segmentation by Combining Bag-of-Words Models with Dirichlet Processes and Random Fields

International Journal of Computer Vision
Comparing compact codebooks for visual categorization

Computer Vision and Image Understanding
Using Basic Image Features for Texture Classification

International Journal of Computer Vision
Object category recognition using generative template boosting

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Constructing vocabulary ensembles by different clustering algorithms for object categorization

IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
UJM at ImageCLEFwiki 2008

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Selecting representative and distinctive descriptors for efficient landmark recognition

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Measuring conceptual relation of visual words for visual categorization

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Everyday concept detection in visual lifelogs: validation, relationships and trends

Multimedia Tools and Applications
Discriminative codeword selection for image representation

Proceedings of the international conference on Multimedia
Building contextual visual vocabulary for large-scale image applications

Proceedings of the international conference on Multimedia
Efficient and robust near-duplicate detection in large and growing image data-sets

Proceedings of the international conference on Multimedia
Color based tracing in real-life surveillance data

Transactions on data hiding and multimedia security V
Combining text/image in wikipediaMM task 2009

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
The University of Aamsterdam's concept detection system at ImageCLEF 2009

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Weighted symbols-based edit distance for string-structured image classification

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Randomized locality sensitive vocabularies for bag-of-features model

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Building descriptive and discriminative visual codebook for large-scale image applications

Multimedia Tools and Applications
Human action recognition by SOM considering the probability of spatio-temporal features

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: models and applications - Volume Part II
Robust object categorization and segmentation motivated by visual contexts in the human visual system

EURASIP Journal on Advances in Signal Processing - Special issue on biologically inspired signal processing: analyses, algorithms and applications
A multi-scale learning framework for visual categorization

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
A hybrid supervised-unsupervised vocabulary generation algorithm for visual concept recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Image classification using spatial pyramid coding and visual word reweighting

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Toward a higher-level visual representation for content-based image retrieval

Proceedings of the 8th International Conference on Advances in Mobile Computing and Multimedia
Region Contextual Visual Words for scene categorization

Expert Systems with Applications: An International Journal
Spatial codebooks for image categorization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Multimedia retrieval in social networks for photo book creation

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Semantics extraction from images

Knowledge-driven multimedia information extraction and ontology evolution
Skeleton Search: Category-Specific Object Recognition and Segmentation Using a Skeletal Shape Model

International Journal of Computer Vision
Appearance-only SLAM at large scale with FAB-MAP 2.0

International Journal of Robotics Research
Supervised visual vocabulary with category information

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Improvements in image categorization using codebook ensembles

Image and Vision Computing
Marginal-based visual alphabets for local image descriptors aggregation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Efficient and Effective Visual Codebook Generation Using Additive Kernels

The Journal of Machine Learning Research
Hyperfeatures – multilevel local coding for visual recognition

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Sampling strategies for bag-of-features image classification

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
The 2005 PASCAL visual object classes challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Multi-scale and real-time non-parametric approach for anomaly detection and localization

Computer Vision and Image Understanding
The Visual Extent of an Object

International Journal of Computer Vision
Composed complex-cue histograms: An investigation of the information content in receptive field based image descriptors for object recognition

Computer Vision and Image Understanding
Image classification using probability higher-order local auto-correlations

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
A non-parametric visual-sense model of images--extending the cluster hypothesis beyond text

Multimedia Tools and Applications
Johnny: An Autonomous Service Robot for Domestic Environments

Journal of Intelligent and Robotic Systems
A cascade of unsupervised and supervised neural networks for natural image classification

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Building implicit dictionaries based on extreme random clustering for modality recognition

MCBR-CDS'11 Proceedings of the Second MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support
A local spectral distribution approach to face recognition

Computer Vision and Image Understanding
Class consistent k-means: Application to face and action recognition

Computer Vision and Image Understanding
Modulating Shape Features by Color Attention for Object Recognition

International Journal of Computer Vision
Object recognition using discriminative parts

Computer Vision and Image Understanding
Object categorization with sketch representation and generalized samples

Pattern Recognition
Learning compact visual descriptor for low bit rate mobile landmark search

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Supervised learning of Gaussian mixture models for visual vocabulary generation

Pattern Recognition
Image representation for generic object recognition using higher-order local autocorrelation features on posterior probability images

Pattern Recognition
A Review of Codebook Models in Patch-Based Visual Object Recognition

Journal of Signal Processing Systems
Global localization with non-quantized local image features

Robotics and Autonomous Systems
Constrained keypoint quantization: towards better bag-of-words model for large-scale multimedia retrieval

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Toward a higher-level visual representation for content-based image retrieval

Multimedia Tools and Applications
Hierarchical Classifiers for Robust Topological Robot Localization

Journal of Intelligent and Robotic Systems
Unsupervised object discovery via self-organisation

Pattern Recognition Letters
Bag of spatio-visual words for context inference in scene classification

Pattern Recognition
Dog breed classification using part localization

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Randomized spatial partition for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Approximate gaussian mixtures for large scale vocabularies

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Exploring bag of words architectures in the facial expression domain

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume 2
Texture Description Through Histograms of Equivalent Patterns

Journal of Mathematical Imaging and Vision
Sparse representation and learning in visual recognition: Theory and applications

Signal Processing
Learning dictionary on manifolds for image classification

Pattern Recognition
Efficient image signatures and similarities using tensor products of local descriptors

Computer Vision and Image Understanding
Weakly supervised codebook learning by iterative label propagation with graph quantization

Signal Processing
Discriminative codebook learning for Web image search

Signal Processing
Automatic Annotation of Scientific Video Material based on Visual Concept Detection

Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies
Joint learning and weighting of visual vocabulary for bag-of-feature based tissue classification

Pattern Recognition
Weighted visual vocabulary to balance the descriptive ability on general dataset

Neurocomputing
Object class detection: A survey

ACM Computing Surveys (CSUR)
Discriminative two-level feature selection for realistic human action recognition

Journal of Visual Communication and Image Representation
Aesthetic capital: what makes london look beautiful, quiet, and happy?

Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
Unsupervised approximate-semantic vocabulary learning for human action and video classification

Pattern Recognition Letters
Learning group-based dictionaries for discriminative image representation

Pattern Recognition
Visual word spatial arrangement for image retrieval and classification

Pattern Recognition
Towards robust object categorization for mobile robots with combination of classifiers

Robot Soccer World Cup XV
Fast and efficient visual codebook construction for multi-label annotation using predictive clustering trees

Pattern Recognition Letters
Visual words dictionaries and fusion techniques for searching people through textual and visual attributes

Pattern Recognition Letters
A Klein-Bottle-Based Dictionary for Texture Representation

International Journal of Computer Vision
Enhancing K-Means using class labels

Intelligent Data Analysis
Robust human action recognition scheme based on high-level feature fusion

Multimedia Tools and Applications
Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Visual codebook based quantization of robust appearance descriptors extracted from local image patches is an effective means of capturing image statistics for texture analysis and scene classification. Codebooks are usually constructed by using a method such as k-means to cluster the descriptor vectors of patches sampled either densely (驴textons驴) or sparsely (驴bags of features驴 based on key-points or salience measures) from a set of training images. This works well for texture analysis in homogeneous images, but the images that arise in natural object recognition tasks have far less uniform statistics. We show that for dense sampling, k-means over-adapts to this, clustering centres almost exclusively around the densest few regions in descriptor space and thus failing to code other informative regions. This gives suboptimal codes that are no better than using randomly selected centres. We describe a scalable acceptance-radius based clusterer that generates better codebooks and study its performance on several image classification tasks. We also show that dense representations outperform equivalent keypoint based ones on these tasks and that SVM or Mutual Information based feature selection starting from a dense codebook further improves the performance.