Metric learning for large scale image classification: generalizing to new classes at near-zero cost

Authors:
Thomas Mensink;Jakob Verbeek;Florent Perronnin;Gabriela Csurka
Affiliations:
LEAR, INRIA Grenoble, Montbonnot, France,TVPA, Xerox Research Centre Europe, Meylan, France;LEAR, INRIA Grenoble, Montbonnot, France;TVPA, Xerox Research Centre Europe, Meylan, France;TVPA, Xerox Research Centre Europe, Meylan, France
Venue:
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Year:
2012

Citing 17
Cited 2

LESS: A Model-Based Classifier for Sparse Subspaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
One-Shot Learning of Object Categories

IEEE Transactions on Pattern Analysis and Machine Intelligence
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
SIFT-Bag kernel for video event analysis

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Distance Metric Learning for Large Margin Nearest Neighbor Classification

The Journal of Machine Learning Research
Large margin nearest local mean classifier

Signal Processing
Zero-data learning of new tasks

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Large Scale Online Learning of Image Similarity Through Ranking

The Journal of Machine Learning Research
Learning to rank with (a lot of) word features

Information Retrieval
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
What does classifying more than 10,000 image categories tell us?

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Product Quantization for Nearest Neighbor Search

IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluating knowledge transfer and zero-shot learning in a large-scale setting

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
High-dimensional signature compression for large-scale image classification

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Quantization

IEEE Transactions on Information Theory
WSABIE: scaling up to large vocabulary image annotation

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Aggregating Local Image Descriptors into Compact Codes

IEEE Transactions on Pattern Analysis and Machine Intelligence

A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics

International Journal of Computer Vision
Image Classification with the Fisher Vector: Theory and Practice

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

We are interested in large-scale image classification and especially in the setting where images corresponding to new or existing classes are continuously added to the training set. Our goal is to devise classifiers which can incorporate such images and classes on-the-fly at (near) zero cost. We cast this problem into one of learning a metric which is shared across all classes and explore k-nearest neighbor (k-NN) and nearest class mean (NCM) classifiers. We learn metrics on the ImageNet 2010 challenge data set, which contains more than 1.2M training images of 1K classes. Surprisingly, the NCM classifier compares favorably to the more flexible k-NN classifier, and has comparable performance to linear SVMs. We also study the generalization performance, among others by using the learned metric on the ImageNet-10K dataset, and we obtain competitive performance. Finally, we explore zero-shot classification, and show how the zero-shot model can be combined very effectively with small training datasets.