Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision
On the algorithmic implementation of multiclass kernel-based vector machines
The Journal of Machine Learning Research
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
In Defense of One-Vs-All Classification
The Journal of Machine Learning Research
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Building a Classification Cascade for Visual Identification from One Example
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
One-Shot Learning of Object Categories
IEEE Transactions on Pattern Analysis and Machine Intelligence
Scalable Recognition with a Vocabulary Tree
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Visual Vocabulary for Flower Classification
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions
FOCS '06 Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science
A note on Platt's probabilistic outputs for support vector machines
Machine Learning
Local invariant feature detectors: a survey
Foundations and Trends® in Computer Graphics and Vision
LIBLINEAR: A Library for Large Linear Classification
The Journal of Machine Learning Research
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Large scale natural image classification by sparsity exploration
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
The 2005 PASCAL visual object classes challenge
MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
A3P: adaptive policy prediction for shared images over popular content sharing sites
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Optimization of robust loss functions for weakly-labeled image taxonomies: an imagenet case study
EMMCVPR'11 Proceedings of the 8th international conference on Energy minimization methods in computer vision and pattern recognition
Multiple region categorization for scenery images
ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
Learning to judge image search results
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Building semantic hierarchies faithful to image semantics
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
All vehicles are cars: subclass preferences in container concepts
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Proceedings of the 21st ACM international conference on Information and knowledge management
An efficient two-stage framework for image annotation
Pattern Recognition
Efficient object categorization with the surface-approximation polynomials descriptor
SC'12 Proceedings of the 2012 international conference on Spatial Cognition VIII
Bottom-up perceptual organization of images into object part hypotheses
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Taxonomic multi-class prediction and person layout using efficient structured ranking
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Metric learning for large scale image classification: generalizing to new classes at near-zero cost
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Nested sparse quantization for efficient feature coding
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Learning compact visual attributes for large-scale image classification
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Learning attribute-aware dictionary for image classification and search
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
The pooled NBNN kernel: beyond image-to-class and image-to-image
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Underwater live fish recognition using a balance-guaranteed optimized tree
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation
Proceedings of the 21st ACM international conference on Multimedia
Flickr-tag prediction using multi-modal fusion and meta information
Proceedings of the 21st ACM international conference on Multimedia
Large scale visual classification with many classes
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Learning group-based dictionaries for discriminative image representation
Pattern Recognition
Semantic context based refinement for news video annotation
Multimedia Tools and Applications
A framework for selection and fusion of pattern classifiers in multimedia recognition
Pattern Recognition Letters
Multimedia event detection with multimodal feature fusion and temporal concept localization
Machine Vision and Applications
Image categorization using a semantic hierarchy model with sparse set of salient regions
Frontiers of Computer Science: Selected Publications from Chinese Universities
Image Classification with the Fisher Vector: Theory and Practice
International Journal of Computer Vision
Hi-index | 0.00 |
Image classification is a critical task for both humans and computers. One of the challenges lies in the large scale of the semantic space. In particular, humans can recognize tens of thousands of object classes and scenes. No computer vision algorithm today has been tested at this scale. This paper presents a study of large scale categorization including a series of challenging experiments on classification with more than 10, 000 image classes. We find that a) computational issues become crucial in algorithm design; b) conventional wisdom from a couple of hundred image categories on relative performance of different classifiers does not necessarily hold when the number of categories increases; c) there is a surprisingly strong relationship between the structure of WordNet (developed for studying language) and the difficulty of visual categorization; d) classification can be improved by exploiting the semantic hierarchy. Toward the future goal of developing automatic vision algorithms to recognize tens of thousands or even millions of image categories, we make a series of observations and arguments about dataset scale, category density, and image hierarchy.