Real-Time Visual Concept Classification

Authors:
J. R.R. Uijlings;A. W.M. Smeulders;R. J.H. Scha
Affiliations:
Intell. Syst. Lab. Amsterdam, Univ. of Amsterdam, Amsterdam, Netherlands;-;-
Venue:
IEEE Transactions on Multimedia
Year:
2010

Citing 0
Cited 16

Social negative bootstrapping for visual categorization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Instant Bag-of-Words served on a laptop

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Classroom video assessment and retrieval via multiple instance learning

AIED'11 Proceedings of the 15th international conference on Artificial intelligence in education
Text and image subject classifiers: dense works better

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Exploitation of time constraints for (sub-)event recognition

J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Fusing concept detection and geo context for visual search

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
SUPER: towards real-time event recognition in internet videos

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Categorization of a collection of pictures into structured events

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Robust image retrieval using bag of visual words with fuzzy codebooks and fuzzy assignment

Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies
In the eye of the beholder: employing statistical analysis and eye tracking for analyzing abstract paintings

Proceedings of the 20th ACM international conference on Multimedia
Distributional semantics with eyes: using image analysis to improve computational representations of word meaning

Proceedings of the 20th ACM international conference on Multimedia
Enhanced representation and multi-task learning for image annotation

Computer Vision and Image Understanding
Exploiting language models to recognize unseen actions

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Content-based tag propagation and tensor factorization for personalized item recommendation based on social tagging

ACM Transactions on Interactive Intelligent Systems (TiiS)
Selective Search for Object Recognition

International Journal of Computer Vision
Continuous human action recognition in real time

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

As datasets grow increasingly large in content-based image and video retrieval, computational efficiency of concept classification is important. This paper reviews techniques to accelerate concept classification, where we show the trade-off between computational efficiency and accuracy. As a basis, we use the Bag-of-Words algorithm that in the 2008 benchmarks of TRECVID and PASCAL lead to the best performance scores. We divide the evaluation in three steps: 1) Descriptor Extraction, where we evaluate SIFT, SURF, DAISY, and Semantic Textons. 2) Visual Word Assignment, where we compare a k-means visual vocabulary with a Random Forest and evaluate subsampling, dimension reduction with PCA, and division strategies of the Spatial Pyramid. 3) Classification, where we evaluate the χ2, RBF, and Fast Histogram Intersection kernel for the SVM. Apart from the evaluation, we accelerate the calculation of densely sampled SIFT and SURF, accelerate nearest neighbor assignment, and improve accuracy of the Histogram Intersection kernel. We conclude by discussing whether further acceleration of the Bag-of-Words pipeline is possible. Our results lead to a 7-fold speed increase without accuracy loss, and a 70-fold speed increase with 3% accuracy loss. The latter system does classification in real-time, which opens up new applications for automatic concept classification. For example, this system permits five standard desktop PCs to automatically tag for 20 classes all images that are currently uploaded to Flickr.