Accelerating visual categorization with the GPU

Authors:
Koen E. A. van de Sande;Theo Gevers;Cees G. M. Snoek
Affiliations:
Intelligent Systems Lab Amsterdam (ISLA), University of Amsterdam, Amsterdam, The Netherlands;Intelligent Systems Lab Amsterdam (ISLA), University of Amsterdam, Amsterdam, The Netherlands;Intelligent Systems Lab Amsterdam (ISLA), University of Amsterdam, Amsterdam, The Netherlands
Venue:
ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Year:
2010

Citing 22
Cited 0

Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
A Comparison of Affine Region Detectors

International Journal of Computer Vision
Fast codebook search algorithms based on tree-structured vector quantization

Pattern Recognition Letters
Evaluation campaigns and TRECVid

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Scan primitives for GPU computing

Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
Video diver: generic video indexing with diverse features

Proceedings of the international workshop on Workshop on multimedia information retrieval
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Speeded-Up Robust Features (SURF)

Computer Vision and Image Understanding
Fast support vector machine training and classification on graphics processors

Proceedings of the 25th international conference on Machine learning
Speed Up SVM Algorithm for Massive Classification Tasks

ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Efficient Kernel Discriminant Analysis via Spectral Regression

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Real-time bag of words, approximately

Proceedings of the ACM International Conference on Image and Video Retrieval
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
Visual Word Ambiguity

IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature tracking and matching in video using programmable graphics hardware

Machine Vision and Applications
Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study

IEEE Transactions on Multimedia
Empowering Visual Categorization With the GPU

IEEE Transactions on Multimedia
Fast anisotropic Gauss filtering

IEEE Transactions on Image Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Visual categorization is important to manage large collections of digital images and video, where textual meta-data is often incomplete or simply unavailable. The bag-of-words model has become the most powerful method for visual categorization of images and video. Despite its high accuracy, a severe drawback of this model is its high computational cost. As the trend to increase computational power in newer CPU and GPU architectures is to increase their level of parallelism, exploiting this parallelism becomes an important direction to handle the computational cost of the bag-of-words approach. In this paper, we analyze the bag-of-words model for visual categorization in terms of computational cost and identify two major bottlenecks: the quantization step and the classification step. We address these two bottlenecks by proposing two efficient algorithms for quantization and classification by exploiting the GPU hardware and the CUDA parallel programming model. The algorithms are designed to keep categorization accuracy intact and give the same numerical results. In the experiments on large scale datasets it is shown that, by using a parallel implementation on the GPU, quantization is 28 times faster and classification is 35 faster than a single-threaded CPU version, while giving the exact same numerical results. The GPU accelerations are applicable to both the learning phase and the testing phase of visual categorization systems. For software visit http://www.colordescriptors.com/ .