New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative
Proceedings of the international conference on Multimedia information retrieval
Comparing compact codebooks for visual categorization
Computer Vision and Image Understanding
The University of Aamsterdam's concept detection system at ImageCLEF 2009
CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Building compact local pairwise codebook with joint feature space clustering
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Region matching techniques for spatial bag of visual words based image category recognition
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
Improving the fisher kernel for large-scale image classification
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Building descriptive and discriminative visual codebook for large-scale image applications
Multimedia Tools and Applications
The university of surrey visual concept detection system at imageCLEF@ICPR: working notes
ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
A BOVW based query generative model
MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Modeling sense disambiguation of human pose: recognizing action at a distance by key poses
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
Image classification using spatial pyramid coding and visual word reweighting
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Adaptive learning codebook for action recognition
Pattern Recognition Letters
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Hierarchical annotation of medical images
Pattern Recognition
Content based detection of popular images in large image databases
SCIA'11 Proceedings of the 17th Scandinavian conference on Image analysis
Building global image features for scene recognition
Pattern Recognition
Nonparametric estimation of fisher vectors to aggregate image descriptors
ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Person re-identification based on global color context
ACCV'10 Proceedings of the 2010 international conference on Computer vision - Volume Part I
Modeling multimedia contents through probabilistic feature signatures
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Contextual synonym dictionary for visual object retrieval
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Images as sets of locally weighted features
Computer Vision and Image Understanding
Visual synonyms for landmark image retrieval
Computer Vision and Image Understanding
Encoding spatial arrangement of visual words
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Learning compact visual descriptor for low bit rate mobile landmark search
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Supervised learning of Gaussian mixture models for visual vocabulary generation
Pattern Recognition
Hamming embedding similarity-based image classification
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Visual pattern discovery for architecture image classification and product image search
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Multimodal feature generation framework for semantic image classification
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
A visual approach for video geocoding using bag-of-scenes
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Improving Image Classification Using Semantic Attributes
International Journal of Computer Vision
IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Content-based image retrieval using color difference histogram
Pattern Recognition
Image and Vision Computing
Multi-channel shape-flow kernel descriptors for robust video event detection and retrieval
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Approximate gaussian mixtures for large scale vocabularies
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Complex events detection using data-driven concepts
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Unsupervised and supervised visual codes with restricted boltzmann machines
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Information theoretic learning for pixel-based visual agents
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Learning compact visual attributes for large-scale image classification
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Hybrid pooling fusion in the bow pipeline
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Accelerating visual categorization with the GPU
ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Spherical soft assignment: improving image representation in content-based image retrieval
PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Learning dictionary on manifolds for image classification
Pattern Recognition
Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection
Computer Vision and Image Understanding
Pooling in image representation: The visual codeword point of view
Computer Vision and Image Understanding
EvoApplications'13 Proceedings of the 16th European conference on Applications of Evolutionary Computation
Efficient image signatures and similarities using tensor products of local descriptors
Computer Vision and Image Understanding
Multi-annulus partition based image representation for image classification
International Journal of Sensor Networks
Domain-specific image geocoding: a case study on Virginia tech building photos
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Spatially local coding for object recognition
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Contextual pooling in image classification
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Spatial graph for image classification
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
SIFT on manifold: An intrinsic description
Neurocomputing
WaveLBP based hierarchical features for image classification
Pattern Recognition Letters
GIANT: geo-informative attributes for location recognition and exploration
Proceedings of the 21st ACM international conference on Multimedia
Beyond bag of words: image representation in sub-semantic space
Proceedings of the 21st ACM international conference on Multimedia
Compact bag-of-words visual representation for effective linear classification
Proceedings of the 21st ACM international conference on Multimedia
Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors
Proceedings of the 21st ACM international conference on Multimedia
Multiple instance classification: Review, taxonomy and comparative study
Artificial Intelligence
Object class detection: A survey
ACM Computing Surveys (CSUR)
A comparative study on mobile visual recognition
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Unsupervised approximate-semantic vocabulary learning for human action and video classification
Pattern Recognition Letters
Background subtraction using hybrid feature coding in the bag-of-features framework
Pattern Recognition Letters
Bilevel visual words coding for image classification
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Visual word spatial arrangement for image retrieval and classification
Pattern Recognition
Image and Vision Computing
Image Classification with the Fisher Vector: Theory and Practice
International Journal of Computer Vision
Hi-index | 0.14 |
This paper studies automatic image classification by modeling soft assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inherent component of the codebook model is the assignment of discrete visual words to continuous image features. Despite the clear mismatch of this hard assignment with the nature of continuous features, the approach has been successfully applied for some years. In this paper, we investigate four types of soft assignment of visual words to image features. We demonstrate that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model. The traditional codebook model is compared against our method for five well-known data sets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008. We demonstrate that large codebook vocabulary sizes completely deteriorate the performance of the traditional model, whereas the proposed model performs consistently. Moreover, we show that our method profits in high-dimensional feature spaces and reaps higher benefits when increasing the number of image categories.