Scalable near identical image and shot detection

Authors:
Ondřej Chum;James Philbin;Michael Isard;Andrew Zisserman
Affiliations:
University of Oxford;University of Oxford;Silicon Valley;University of Oxford
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 20
Cited 41

Color Invariance

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?"

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
An Affine Invariant Interest Point Detector

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Stable distributions, pseudorandom generators, embeddings and data stream computation

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
On the Resemblance and Containment of Documents

SEQUENCES '97 Proceedings of the Compression and Complexity of Sequences 1997
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Automated location matching in movies

Computer Vision and Image Understanding - Special isssue on video retrieval and summarization
Fast video matching with signature alignment

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Locality-sensitive hashing scheme based on p-stable distributions

SCG '04 Proceedings of the twentieth annual symposium on Computational geometry
An efficient parts-based near-duplicate and sub-image retrieval system

Proceedings of the 12th annual ACM international conference on Multimedia
Detecting image near-duplicate by stochastic attributed relational graph matching with learning

Proceedings of the 12th annual ACM international conference on Multimedia
Automatic identification of digital video based on shot-level sequence matching

Proceedings of the 13th annual ACM international conference on Multimedia
A Comparison of Affine Region Detectors

International Journal of Computer Vision
Finding near-duplicate web pages: a large-scale evaluation of algorithms

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Robust content-based video copy identification in a large reference database

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
Video clip matching using MPEG-7 descriptors and edit distance

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Video mining with frequent itemset configurations

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Content-Based Copy Retrieval Using Distortion-Based Probabilistic Similarity Search

IEEE Transactions on Multimedia

Clustering near-duplicate images in large collections

Proceedings of the international workshop on Workshop on multimedia information retrieval
Locality sensitive hash functions based on concomitant rank order statistics

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A decision theoretic framework for analyzing binary hash-based content identification systems

Proceedings of the 8th ACM workshop on Digital rights management
Scalable mining of large video databases using copy detection

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Summarization scheme based on near-duplicate analysis

TVS '08 Proceedings of the 2nd ACM TRECVid Video Summarization Workshop
No bull, no spin: a comparison of tags with other forms of user metadata

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Scalable detection of partial near-duplicate videos by visual-temporal consistency

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Example based video filters

Proceedings of the ACM International Conference on Image and Video Retrieval
Video copy detection by fast sequence matching

Proceedings of the ACM International Conference on Image and Video Retrieval
An efficient near-duplicate video shot detection method using shot-based interest points

IEEE Transactions on Multimedia
Scaling content-based video copy detection to very large databases

Multimedia Tools and Applications
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
CrowdSearch: exploiting crowds for accurate real-time image search on mobile phones

Proceedings of the 8th international conference on Mobile systems, applications, and services
Automatic discovery of image families: global vs. local features

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Near-duplicate keyframe retrieval by semi-supervised learning and nonrigid image matching

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Spatial coding for large scale partial-duplicate web image search

Proceedings of the international conference on Multimedia
Efficient incremental near duplicate detection based on locality sensitive hashing

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
XML structural similarity search using mapreduce

WAIM'10 Proceedings of the 11th international conference on Web-age information management
Partition min-hash for partial duplicate image discovery

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Efficient structure from motion by graph optimization

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Error-correcting output hashing in fast similarity search

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Advertisement image recognition for a location-based reminder system

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Pairwise weak geometric consistency for large scale image search

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets

International Journal of Computer Vision
Large scale image search with geometric coding

MM '11 Proceedings of the 19th ACM international conference on Multimedia
From local features to local regions

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Proximity-Based order-respecting intersection for searching in image databases

AMR'10 Proceedings of the 8th international conference on Adaptive Multimedia Retrieval: context, exploration, and fusion
Topic based query suggestions for video search

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
High-confidence near-duplicate image detection

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
IRIW: image retrieval based image watermarking for large-scale image databases

IWDW'11 Proceedings of the 10th international conference on Digital-Forensics and Watermarking
Binary SIFT: towards efficient feature matching verification for image search

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Improving bag-of-visual-words model with spatial-temporal correlation for video retrieval

Proceedings of the 21st ACM international conference on Information and knowledge management
SIFT match verification by geometric coding for large-scale partial-duplicate web image search

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
An evaluation of two automatic landmark building discovery algorithms for city reconstruction

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Robust feature bundling

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Content-Based Keyframe Clustering Using Near Duplicate Keyframe Identification

International Journal of Multimedia Data Engineering & Management
Efficient video segment matching for detecting temporal-based video copies

Neurocomputing
Searching visual instances with topology checking and context modeling

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Spatial min-Hash for similar image search

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Sim-min-hash: an efficient matching technique for linking large image collections

Proceedings of the 21st ACM international conference on Multimedia
Ranking consistency for image matching and object retrieval

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes and compares two novel schemes for near duplicate image and video-shot detection. The first approach is based on global hierarchical colour histograms, using Locality Sensitive Hashing for fast retrieval. The second approach uses local feature descriptors (SIFT) and for retrieval exploits techniques used in the information retrieval community to compute approximate set intersections between documents using a min-Hash algorithm. The requirements for near-duplicate images vary according to the application, and we address two types of near duplicate definition: (i) being perceptually identical (e.g. up to noise, discretization effects, small photometric distortions etc); and (ii) being images of the same 3D scene (so allowing for viewpoint changes and partial occlusion). We define two shots to be near-duplicates if they share a large percentage of near-duplicate frames. We focus primarily on scalability to very large image and video databases, where fast query processing is necessary. Both methods are designed so that only a small amount of data need be stored for each image. In the case of near-duplicate shot detection it is shown that a weak approximation to histogram matching, consuming substantially less storage, is sufficient for good results. We demonstrate our methods on the TRECVID 2006 data set which contains approximately 165 hours of video (about 17.8M frames with 146K key frames), and also on feature films and pop videos.