Near-duplicate keyframe retrieval with visual keywords and semantic context

Authors:
Xiao Wu;Wan-Lei Zhao;Chong-Wah Ngo
Affiliations:
University of Hong Kong, Hong Kong;University of Hong Kong, Hong Kong;University of Hong Kong, Hong Kong
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 18
Cited 25

A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The LIMSI Broadcast News transcription system

Speech Communication - Special issue on automatic transcription of broadcast news data
Novelty and redundancy detection in adaptive filtering

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval and novelty detection at the sentence level

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Boosting contextual information in content-based image retrieval

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
From context to content: leveraging context to infer media metadata

Proceedings of the 12th annual ACM international conference on Multimedia
Towards auto-documentary: tracking the evolution of news stories

Proceedings of the 12th annual ACM international conference on Multimedia
An efficient parts-based near-duplicate and sub-image retrieval system

Proceedings of the 12th annual ACM international conference on Multimedia
Detecting image near-duplicate by stochastic attributed relational graph matching with learning

Proceedings of the 12th annual ACM international conference on Multimedia
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Tracking news stories across different sources

Proceedings of the 13th annual ACM international conference on Multimedia
A mutual semantic endorsement approach to image retrieval and context provision

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning

IEEE Transactions on Multimedia

Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news

Computer Vision and Image Understanding
Near-duplicate keyframe retrieval by nonrigid image matching

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Scalable mining of large video databases using copy detection

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval

Computer Vision and Image Understanding
Fast Content-Based Mining of Web2.0 Videos

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Near-duplicate video matching with transformation recognition

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Intention-focused active reranking for image object retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
Real-time near-duplicate elimination for web video search with content and context

IEEE Transactions on Multimedia - Special issue on integration of context and content
An efficient near-duplicate video shot detection method using shot-based interest points

IEEE Transactions on Multimedia
Robust copy detection by mining temporal self-similarities

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Stratification-based keyframe cliques for removal of near-duplicates in video search results

Proceedings of the international conference on Multimedia information retrieval
Near-duplicate keyframe retrieval by semi-supervised learning and nonrigid image matching

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Exploiting local dependencies with spatial-scale space (S-Cube) for near-duplicate retrieval

Computer Vision and Image Understanding
Correlation-based retrieval for heavily changed near-duplicate videos

ACM Transactions on Information Systems (TOIS)
Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection

Image Communication
Spatially-coherent pyramid matching based on max-pooling

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multiple feature hashing for real-time large scale near-duplicate video retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Contextual synonym dictionary for visual object retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
A visual approach for video geocoding using bag-of-scenes

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Quality assurance for document image collections in digital preservation

ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
An expert system for quality assurance of document image collections

EuroMed'12 Proceedings of the 4th international conference on Progress in Cultural Heritage Preservation
Content-Based Keyframe Clustering Using Near Duplicate Keyframe Identification

International Journal of Multimedia Data Engineering & Management
Semantic browsing in large scale videos collection

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Near-duplicate video retrieval: Current research and future trends

ACM Computing Surveys (CSUR)
Duplicate detection approaches for quality assurance of document image collections

Proceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Near-duplicate keyframes (NDK) play a unique role in large-scale video search, news topic detection and tracking. In this paper, we propose a novel NDK retrieval approach by exploring both visual and textual cues from the visual vocabulary and semantic context respectively. The vocabulary, which provides entries for visual keywords, is formed by the clustering of local keypoints. The semantic context is inferred from the speech transcript surrounding a keyframe. We experiment the usefulness of visual keywords and semantic context, separately and jointly, using cosine similarity and language models. By linearly fusing both modalities, performance improvement is reported compared with the techniques with keypoint matching. While matching suffers from expensive computation due to the need of online nearest neighbor search, our approach is effective and efficient enough for online video search.