Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
A Comparison of Affine Region Detectors
International Journal of Computer Vision
Fast codebook search algorithms based on tree-structured vector quantization
Pattern Recognition Letters
Evaluation campaigns and TRECVid
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
The challenge problem for automated detection of 101 semantic concepts in multimedia
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
International Journal of Computer Vision
Scan primitives for GPU computing
Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
Video diver: generic video indexing with diverse features
Proceedings of the international workshop on Workshop on multimedia information retrieval
Image retrieval: Ideas, influences, and trends of the new age
ACM Computing Surveys (CSUR)
Speeded-Up Robust Features (SURF)
Computer Vision and Image Understanding
Fast support vector machine training and classification on graphics processors
Proceedings of the 25th international conference on Machine learning
Speed Up SVM Algorithm for Massive Classification Tasks
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Efficient Kernel Discriminant Analysis via Spectral Regression
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Real-time bag of words, approximately
Proceedings of the ACM International Conference on Image and Video Retrieval
The Pascal Visual Object Classes (VOC) Challenge
International Journal of Computer Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluating Color Descriptors for Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature tracking and matching in video using programmable graphics hardware
Machine Vision and Applications
Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study
IEEE Transactions on Multimedia
Empowering Visual Categorization With the GPU
IEEE Transactions on Multimedia
Fast anisotropic Gauss filtering
IEEE Transactions on Image Processing
Hi-index | 0.00 |
Visual categorization is important to manage large collections of digital images and video, where textual meta-data is often incomplete or simply unavailable. The bag-of-words model has become the most powerful method for visual categorization of images and video. Despite its high accuracy, a severe drawback of this model is its high computational cost. As the trend to increase computational power in newer CPU and GPU architectures is to increase their level of parallelism, exploiting this parallelism becomes an important direction to handle the computational cost of the bag-of-words approach. In this paper, we analyze the bag-of-words model for visual categorization in terms of computational cost and identify two major bottlenecks: the quantization step and the classification step. We address these two bottlenecks by proposing two efficient algorithms for quantization and classification by exploiting the GPU hardware and the CUDA parallel programming model. The algorithms are designed to keep categorization accuracy intact and give the same numerical results. In the experiments on large scale datasets it is shown that, by using a parallel implementation on the GPU, quantization is 28 times faster and classification is 35 faster than a single-threaded CPU version, while giving the exact same numerical results. The GPU accelerations are applicable to both the learning phase and the testing phase of visual categorization systems. For software visit http://www.colordescriptors.com/ .