Large-scale image classification: Fast feature extraction and SVM training

Authors:
Yuanqing Lin; Fengjun Lv; Shenghuo Zhu; Ming Yang;T. Cour; Kai Yu; Liangliang Cao;T. Huang
Affiliations:
NEC Labs. America, Cupertino, CA, USA;NEC Labs. America, Cupertino, CA, USA;NEC Labs. America, Cupertino, CA, USA;NEC Labs. America, Cupertino, CA, USA;NEC Labs. America, Cupertino, CA, USA;NEC Labs. America, Cupertino, CA, USA;Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA;Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Venue:
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Year:
2011

Citing 0
Cited 7

Optimization of robust loss functions for weakly-labeled image taxonomies: an imagenet case study

EMMCVPR'11 Proceedings of the 8th international conference on Energy minimization methods in computer vision and pattern recognition
Dynamic vocabularies for web-based concept detection by trend discovery

Proceedings of the 20th ACM international conference on Multimedia
Training inter-related classifiers for automatic image classification and annotation

Pattern Recognition
Large scale visual classification with many classes

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Face recognition for web-scale datasets

Computer Vision and Image Understanding
Breast tumor detection in digital mammography based on extreme learning machine

Neurocomputing
A framework for selection and fusion of pattern classifiers in multimedia recognition

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most research efforts on image classification so far have been focused on medium-scale datasets, which are often defined as datasets that can fit into the memory of a desktop (typically 4G~48G). There are two main reasons for the limited effort on large-scale image classification. First, until the emergence of ImageNet dataset, there was almost no publicly available large-scale benchmark data for image classification. This is mostly because class labels are expensive to obtain. Second, large-scale classification is hard because it poses more challenges than its medium-scale counterparts. A key challenge is how to achieve efficiency in both feature extraction and classifier training without compromising performance. This paper is to show how we address this challenge using ImageNet dataset as an example. For feature extraction, we develop a Hadoop scheme that performs feature extraction in parallel using hundreds of mappers. This allows us to extract fairly sophisticated features (with dimensions being hundreds of thousands) on 1.2 million images within one day. For SVM training, we develop a parallel averaging stochastic gradient descent (ASGD) algorithm for training one-against-all 1000-class SVM classifiers. The ASGD algorithm is capable of dealing with terabytes of training data and converges very fast-typically 5 epochs are sufficient. As a result, we achieve state-of-the-art performance on the ImageNet 1000-class classification, i.e., 52.9% in classification accuracy and 71.8% in top 5 hit rate.