Visual Word Ambiguity

Authors:
Jan C. van Gemert;Cor J. Veenman;Arnold W. M. Smeulders;Jan-Mark Geusebroek
Affiliations:
Ecole Normale Supérieure, Paris;University of Amsterdam, Amsterdam;University of Amsterdam, Amsterdam;University of Amsterdam, Amsterdam
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2010

Citing 0
Cited 73

New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative

Proceedings of the international conference on Multimedia information retrieval
Comparing compact codebooks for visual categorization

Computer Vision and Image Understanding
Unsupervised clustering in Hough space for recognition of multiple instances of the same object in a cluttered scene

Pattern Recognition Letters
The University of Aamsterdam's concept detection system at ImageCLEF 2009

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Building compact local pairwise codebook with joint feature space clustering

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Region matching techniques for spatial bag of visual words based image category recognition

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Building descriptive and discriminative visual codebook for large-scale image applications

Multimedia Tools and Applications
The university of surrey visual concept detection system at imageCLEF@ICPR: working notes

ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
A BOVW based query generative model

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Modeling sense disambiguation of human pose: recognizing action at a distance by key poses

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
Image classification using spatial pyramid coding and visual word reweighting

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Adaptive learning codebook for action recognition

Pattern Recognition Letters
Exploiting photographic style for category-level image classification by generalizing the spatial pyramid

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Hierarchical annotation of medical images

Pattern Recognition
Content based detection of popular images in large image databases

SCIA'11 Proceedings of the 17th Scandinavian conference on Image analysis
Building global image features for scene recognition

Pattern Recognition
Nonparametric estimation of fisher vectors to aggregate image descriptors

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Person re-identification based on global color context

ACCV'10 Proceedings of the 2010 international conference on Computer vision - Volume Part I
Modeling multimedia contents through probabilistic feature signatures

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Contextual synonym dictionary for visual object retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Images as sets of locally weighted features

Computer Vision and Image Understanding
Visual synonyms for landmark image retrieval

Computer Vision and Image Understanding
Encoding spatial arrangement of visual words

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Learning compact visual descriptor for low bit rate mobile landmark search

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Supervised learning of Gaussian mixture models for visual vocabulary generation

Pattern Recognition
Hamming embedding similarity-based image classification

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Visual pattern discovery for architecture image classification and product image search

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Multimodal feature generation framework for semantic image classification

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
A visual approach for video geocoding using bag-of-scenes

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Improving Image Classification Using Semantic Attributes

International Journal of Computer Vision
Codebook quantization for image classification using incremental neural learning and subgraph extraction

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Content-based image retrieval using color difference histogram

Pattern Recognition
Learning-based encoding with soft assignment for age estimation under unconstrained imaging conditions

Image and Vision Computing
Multi-channel shape-flow kernel descriptors for robust video event detection and retrieval

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Approximate gaussian mixtures for large scale vocabularies

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Complex events detection using data-driven concepts

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Unsupervised and supervised visual codes with restricted boltzmann machines

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Information theoretic learning for pixel-based visual agents

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Learning compact visual attributes for large-scale image classification

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Hybrid pooling fusion in the bow pipeline

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Accelerating visual categorization with the GPU

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Spherical soft assignment: improving image representation in content-based image retrieval

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Learning dictionary on manifolds for image classification

Pattern Recognition
Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection

Computer Vision and Image Understanding
Pooling in image representation: The visual codeword point of view

Computer Vision and Image Understanding
Feedback-Based image retrieval using probabilistic hypergraph ranking augmented by ant colony algorithm

EvoApplications'13 Proceedings of the 16th European conference on Applications of Evolutionary Computation
Efficient image signatures and similarities using tensor products of local descriptors

Computer Vision and Image Understanding
Multi-annulus partition based image representation for image classification

International Journal of Sensor Networks
Image classification using Harr-like transformation of local features with coding residuals

Signal Processing
A comparison of 3D interest point descriptors with application to airport baggage object detection in complex CT imagery

Pattern Recognition
Domain-specific image geocoding: a case study on Virginia tech building photos

Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Spatially local coding for object recognition

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Contextual pooling in image classification

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Spatial graph for image classification

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
SIFT on manifold: An intrinsic description

Neurocomputing
WaveLBP based hierarchical features for image classification

Pattern Recognition Letters
GIANT: geo-informative attributes for location recognition and exploration

Proceedings of the 21st ACM international conference on Multimedia
Beyond bag of words: image representation in sub-semantic space

Proceedings of the 21st ACM international conference on Multimedia
Compact bag-of-words visual representation for effective linear classification

Proceedings of the 21st ACM international conference on Multimedia
Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors

Proceedings of the 21st ACM international conference on Multimedia
Multiple instance classification: Review, taxonomy and comparative study

Artificial Intelligence
Joint learning and weighting of visual vocabulary for bag-of-feature based tissue classification

Pattern Recognition
Weighted visual vocabulary to balance the descriptive ability on general dataset

Neurocomputing
Object class detection: A survey

ACM Computing Surveys (CSUR)
A comparative study on mobile visual recognition

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Unsupervised approximate-semantic vocabulary learning for human action and video classification

Pattern Recognition Letters
Background subtraction using hybrid feature coding in the bag-of-features framework

Pattern Recognition Letters
Bilevel visual words coding for image classification

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Visual word spatial arrangement for image retrieval and classification

Pattern Recognition
Histogram of visual words based on locally adaptive regression kernels descriptors for image feature extraction

Neurocomputing
Unsupervised manifold learning using Reciprocal kNN Graphs in image re-ranking and rank aggregation tasks

Image and Vision Computing
Image Classification with the Fisher Vector: Theory and Practice

International Journal of Computer Vision

Quantified Score

Hi-index	0.14

Visualization

Abstract

This paper studies automatic image classification by modeling soft assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inherent component of the codebook model is the assignment of discrete visual words to continuous image features. Despite the clear mismatch of this hard assignment with the nature of continuous features, the approach has been successfully applied for some years. In this paper, we investigate four types of soft assignment of visual words to image features. We demonstrate that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model. The traditional codebook model is compared against our method for five well-known data sets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008. We demonstrate that large codebook vocabulary sizes completely deteriorate the performance of the traditional model, whereas the proposed model performs consistently. Moreover, we show that our method profits in high-dimensional feature spaces and reaps higher benefits when increasing the number of image categories.