An efficient parts-based near-duplicate and sub-image retrieval system

Authors:
Yan Ke;Rahul Sukthankar;Larry Huston
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA and Intel Research Pittsburgh, Pittsburgh, PA;Intel Research Pittsburgh, Pittsburgh, PA
Venue:
Proceedings of the 12th annual ACM international conference on Multimedia
Year:
2004

Citing 12
Cited 131

Approximate nearest neighbors: towards removing the curse of dimensionality

STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Multi-scale sub-image search

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
A software system for automatic albuming of consumer pictures

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Communications of the ACM
Affine/ Photometric Invariants for Planar Intensity Patterns

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume I - Volume I
Similarity Search in High Dimensions via Hashing

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Content based sub-image retrieval via hierarchical tree matching

MMDB '03 Proceedings of the 1st ACM international workshop on Multimedia databases
Robust content-based image searches for copyright protection

MMDB '03 Proceedings of the 1st ACM international workshop on Multimedia databases
Automated location matching in movies

Computer Vision and Image Understanding - Special isssue on video retrieval and summarization
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
PCA-SIFT: a more distinctive representation for local image descriptors

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Enhanced Perceptual Distance Functions and Indexing for Image Replica Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
IrisNet: an internet-scale architecture for multimedia sensors

Proceedings of the 13th annual ACM international conference on Multimedia
Semantic manifold learning for image retrieval

Proceedings of the 13th annual ACM international conference on Multimedia
A unified framework for resolving ambiguity in copy detection

Proceedings of the 13th annual ACM international conference on Multimedia
Photo-to-search: using multimodal queries to search the web from mobile devices

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
A case-study of scoring schemes for the PvS-index

Proceedings of the 2nd international workshop on Computer vision meets databases
Scalability of local image descriptors: a comparative study

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Robust voting algorithm based on labels of behavior for video copy detection

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Pruning SIFT for scalable near-duplicate image matching

ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
Near-duplicate keyframe retrieval with visual keywords and semantic context

Proceedings of the 6th ACM international conference on Image and video retrieval
Z-grid-based probabilistic retrieval for scaling up content-based copy detection

Proceedings of the 6th ACM international conference on Image and video retrieval
The use of temporal, semantic and visual partitioning model for efficient near-duplicate keyframe detection in large scale news corpus

Proceedings of the 6th ACM international conference on Image and video retrieval
Scalable near identical image and shot detection

Proceedings of the 6th ACM international conference on Image and video retrieval
Detection of near-duplicate images for web search

Proceedings of the 6th ACM international conference on Image and video retrieval
New local descriptors based on dissociated dipoles

Proceedings of the 6th ACM international conference on Image and video retrieval
Clustering near-duplicate images in large collections

Proceedings of the international workshop on Workshop on multimedia information retrieval
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts

Proceedings of the 15th international conference on Multimedia
Practical elimination of near-duplicates from web video search

Proceedings of the 15th international conference on Multimedia
UQLIPS: a real-time near-duplicate video clip detection system

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news

Computer Vision and Image Understanding
Scalable landmark recognition using EXTENT

Multimedia Tools and Applications
Improving web information indexing and retrieval based on center block duplication detection

International Journal of Innovative Computing and Applications
Locality sensitive hash functions based on concomitant rank order statistics

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Constructing visual phrases for effective and efficient object-based image retrieval

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Fast identification of visual documents using local descriptors

Proceedings of the eighth ACM symposium on Document engineering
How to Use SIFT Vectors to Analyze an Image with Database Templates

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
High-dimensional descriptor indexing for large multimedia databases

Proceedings of the 17th ACM conference on Information and knowledge management
Near-duplicate keyframe retrieval by nonrigid image matching

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Object fingerprints for content analysis with applications to street landmark localization

MM '08 Proceedings of the 16th ACM international conference on Multimedia
A posteriori multi-probe locality sensitive hashing

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Mixed-initiative photo collage authoring

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Image near-duplicate retrieval using local dependencies in spatial-scale space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Automatic selection of representative photo and smart thumbnailing using near-duplicate detection

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Accelerating near-duplicate video matching by combining visual similarity and alignment distortion

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Finding near-duplicate images on the web using fingerprints

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Large scale image copy detection evaluation

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Spatio-temporal features for robust content-based video copy detection

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
A Fast and Effective Dichotomy Based Hash Algorithm for Image Matching

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval

Computer Vision and Image Understanding
An Efficient Method for Near-Duplicate Video Detection

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
A New Framework for Constructing Accurate Affine Invariant Regions

IEICE - Transactions on Information and Systems
Automatic video tagging using content redundancy

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
MESH-based active Monte Carlo recognition (MESH-AMCR)

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Indexing local configurations of features for scalable content-based video copy detection

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Image copy detection using a robust gabor texture descriptor

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
An efficient key point quantization algorithm for large scale image retrieval

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Query expansion for hash-based image object retrieval

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Scalable detection of partial near-duplicate videos by visual-temporal consistency

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Near-duplicate video matching with transformation recognition

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Secure and robust SIFT

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Fast near duplicate detection for personal image collections

MM '09 Proceedings of the 17th ACM international conference on Multimedia
MyFinder: near-duplicate detection for large image collections

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Techniques for efficient and effective transformed image identification

Journal of Visual Communication and Image Representation
Places clustering of full-length film key-framesusing latent aspect modeling over SIFT matches

IEEE Transactions on Circuits and Systems for Video Technology
Video copy detection by fast sequence matching

Proceedings of the ACM International Conference on Image and Video Retrieval
Exploiting the human-machine gap in image recognition for designing CAPTCHAs

IEEE Transactions on Information Forensics and Security
Real-time near-duplicate elimination for web video search with content and context

IEEE Transactions on Multimedia - Special issue on integration of context and content
An efficient near-duplicate video shot detection method using shot-based interest points

IEEE Transactions on Multimedia
Content-based retrieval of logo and trademarks in unconstrained color image databases using Color Edge Gradient Co-occurrence Histograms

Computer Vision and Image Understanding
Scale-rotation invariant pattern entropy for keypoint-based near-duplicate detection

IEEE Transactions on Image Processing
Image replica detection system utilizing R-trees and linear discriminant analysis

Pattern Recognition
Fast min-hashing indexing and robust spatio-temporal matching for detecting video copies

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Multiple unordered wide-baseline image matching and grouping

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
An online advertisement platform based on image content bidding

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Coherent phrase model for efficient image near-duplicate retrieval

IEEE Transactions on Multimedia
Querying spatial patterns

Proceedings of the 13th International Conference on Extending Database Technology
Scaling content-based video copy detection to very large databases

Multimedia Tools and Applications
Robust image copy detection using multi-resolution histogram

Proceedings of the international conference on Multimedia information retrieval
Stratification-based keyframe cliques for removal of near-duplicates in video search results

Proceedings of the international conference on Multimedia information retrieval
Consumer photo management and browsing facilitated by near-duplicate detection with feature filtering

Journal of Visual Communication and Image Representation
Fast approximate duplicate detection for 2D-NMR spectra

DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
Document retrieval using image features

Proceedings of the 2010 ACM Symposium on Applied Computing
Very fast concentric circle partition-based replica detection method

PSIVT'07 Proceedings of the 2nd Pacific Rim conference on Advances in image and video technology
Near-duplicate detection using a new framework of constructing accurate affine invariant regions

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Using redundant bit vectors for near-duplicate image detection

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Computing parallel speeded-up robust features (P-SURF) via POSIX threads

ICIC'09 Proceedings of the 5th international conference on Emerging intelligent computing technology and applications
A low-dimensional local descriptor incorporating TPS warping for image matching

Image and Vision Computing
Locality sensitive hashing: A comparison of hash function types and querying mechanisms

Pattern Recognition Letters
Scalable clip-based near-duplicate video detection with ordinal measure

Proceedings of the ACM International Conference on Image and Video Retrieval
Automatic discovery of image families: global vs. local features

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Comparative study of features for fingerprint indexing

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Sub-image searching through intersection of local descriptors

Proceedings of the Third International Conference on SImilarity Search and APplications
Near-duplicate keyframe retrieval by semi-supervised learning and nonrigid image matching

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Boosting image object retrieval and indexing by automatically discovered pseudo-objects

Journal of Visual Communication and Image Representation
Monitoring near duplicates over video streams

Proceedings of the international conference on Multimedia
Bags of phrases with codebooks alignment for near duplicate image detection

Proceedings of the 2nd ACM workshop on Multimedia in forensics, security and intelligence
A large-scale performance study of cluster-based high-dimensional indexing

Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval
Efficient incremental near duplicate detection based on locality sensitive hashing

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Partition min-hash for partial duplicate image discovery

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
A local bag-of-features model for large-scale object retrieval

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Beyond keypoints: novel techniques for content-based image matching and retrieval

ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Combination of local and global features for near-duplicate detection

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Advertisement image recognition for a location-based reminder system

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Fine-search for image copy detection based on local affine-invariant descriptor and spatial dependent matching

Multimedia Tools and Applications
A kernel density based approach for large scale image retrieval

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Beyond search: Event-driven summarization for web videos

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Structure tensor series-based matching for near-duplicate video retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Image collection summarization for search result overviewing on mobile devices

IMMPD '11 Proceedings of the 2011 international ACM workshop on Interactive multimedia on mobile and portable devices
Pursuing the holy grail by interrelating user intentions and bag of visual words to perform retrieval adaptation

SBNMA '11 Proceedings of the 2011 ACM workshop on Social and behavioural networked media access
Discovery of image versions in large collections

MMM'07 Proceedings of the 13th International conference on Multimedia Modeling - Volume Part II
Effective content tracking for digital rights management in digital libraries

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Fast approximated SIFT

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
BASIL: effective near-duplicate image detection using gene sequence alignment

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Clustering-Based descriptors for fingerprint indexing and fast retrieval

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
Keyframe retrieval by keypoints: can point-to-point matching help?

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
A hierarchical approach to practical beverage package recognition

PSIVT'11 Proceedings of the 5th Pacific Rim conference on Advances in Image and Video Technology - Volume Part I
Near-duplicate video detection featuring coupled temporal and perceptual visual structures and logical inference based matching

Information Processing and Management: an International Journal
Proximity-Based order-respecting intersection for searching in image databases

AMR'10 Proceedings of the 8th international conference on Adaptive Multimedia Retrieval: context, exploration, and fusion
Improved SIFT-features matching for object recognition

VoCS'08 Proceedings of the 2008 international conference on Visions of Computer Science: BCS International Academic Conference
High-confidence near-duplicate image detection

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Bayesian approach for near-duplicate image detection

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Constrained keypoint quantization: towards better bag-of-words model for large-scale multimedia retrieval

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Distributed KNN-graph approximation via hashing

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
A visual approach for video geocoding using bag-of-scenes

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Submodular video hashing: a unified framework towards video pooling and indexing

Proceedings of the 20th ACM international conference on Multimedia
Constraint-optimized keypoint inhibition/insertion attack: security threat to scale-space image feature extraction

Proceedings of the 20th ACM international conference on Multimedia
Sparsity cue in image copy detection

Proceedings of the 20th ACM international conference on Multimedia
Towards indexing representative images on the web

Proceedings of the 20th ACM international conference on Multimedia
Quality assurance for document image collections in digital preservation

ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
Detection of near-duplicate patches in random images using keypoint-based features

ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
An expert system for quality assurance of document image collections

EuroMed'12 Proceedings of the 4th international conference on Progress in Cultural Heritage Preservation
Archive Film Comparison

International Journal of Multimedia Data Engineering & Management
Content-Based Keyframe Clustering Using Near Duplicate Keyframe Identification

International Journal of Multimedia Data Engineering & Management
An improved method of locality sensitive hashing for indexing large-scale and high-dimensional features

Signal Processing
Fast image copy detection approach based on local fingerprint defined visual words

Signal Processing
SIFT on manifold: An intrinsic description

Neurocomputing
Locality sensitive hashing revisited: filling the gap between theory and algorithm analysis

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Saliency-Based region log covariance feature for image copy detection

IWDW'12 Proceedings of the 11th international conference on Digital Forensics and Watermaking
Duplicate detection approaches for quality assurance of document image collections

Proceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce a system for near-duplicate detection and sub-image retrieval. Such a system is useful for finding copyright violations and detecting forged images. We define near-duplicate as images altered with common transformations such as changing contrast, saturation, scaling, cropping, framing, etc. Our system builds a parts-based representation of images using distinctive local descriptors which give high quality matches even under severe transformations. To cope with the large number of features extracted from the images, we employ locality-sensitive hashing to index the local descriptors. This allows us to make approximate similarity queries that only examine a small fraction of the database. Although locality-sensitive hashing has excellent theoretical performance properties, a standard implementation would still be unacceptably slow for this application. We show that, by optimizing layout and access to the index data on disk, we can efficiently query indices containing millions of keypoints. Our system achieves near-perfect accuracy (100% precision at 99.85% recall) on the tests presented in Meng et al. [16], and consistently strong results on our own, significantly more challenging experiments. Query times are interactive even for collections of thousands of images.