Clustering Billions of Images with Large Scale Nearest Neighbor Search

Authors:
Ting Liu;Charles Rosenberg;Henry A. Rowley
Affiliations:
Google Inc., Mountain View, CA, USA;Google Inc., Mountain View, CA, USA;Google Inc., Mountain View, CA, USA
Venue:
WACV '07 Proceedings of the Eighth IEEE Workshop on Applications of Computer Vision
Year:
2007

Citing 0
Cited 8

Efficient Processing of Nearest Neighbor Queries in Parallel Multimedia Databases

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
An efficient key point quantization algorithm for large scale image retrieval

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Video sequence querying using clustering of objects' appearance models

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
An environment for video content indexing and retrieval base don visual features

WebMedia '09 Proceedings of the XV Brazilian Symposium on Multimedia and the Web
Visual memes in social media: tracking real-world news in YouTube videos

MM '11 Proceedings of the 19th ACM international conference on Multimedia
MMSVC: An efficient unsupervised learning approach for large-scale datasets

Neurocomputing
Multimedia Applications and Security in MapReduce: Opportunities and Challenges

Concurrency and Computation: Practice & Experience
Online image search result grouping with MapReduce-based image clustering and graph construction for large-scale photos

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.00

Visualization

Abstract

The proliferation of the web and digital photography have made large scale image collections containing billions of images a reality. Image collections on this scale make performing even the most common and simple computer vision, image processing, and machine learning tasks non-trivial. An example is nearest neighbor search, which not only serves as a fundamental subproblem in many more sophisticated algorithms, but also has direct applications, such as image retrieval and image clustering. In this paper, we address the nearest neighbor problem as the first step towards scalable image processing. We describe a scalable version of an approximate nearest neighbor search algorithm and discuss how it can be used to find near duplicates among over a billion images.