Fast similarity search and clustering of video sequences on the world-wide-web

Authors:
S. -S. Cheung;A. Zakhor
Affiliations:
Univ. of California, Berkeley, CA, USA;-
Venue:
IEEE Transactions on Multimedia
Year:
2005

Citing 0
Cited 23

Clip-based similarity measure for hierarchical video retrieval

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Efficient spatiotemporal-attention-driven shot matching

Proceedings of the 15th international conference on Multimedia
Practical elimination of near-duplicates from web video search

Proceedings of the 15th international conference on Multimedia
A survey of browsing models for content based image retrieval

Multimedia Tools and Applications
A Clustering Technique for Video Copy Detection

IbPRIA '07 Proceedings of the 3rd Iberian conference on Pattern Recognition and Image Analysis, Part I
Bounded coordinate system indexing for real-time video clip search

ACM Transactions on Information Systems (TOIS)
A secure framework exploiting content guided and automated algorithms for real time video searching

Multimedia Tools and Applications
Adaptive edge-oriented shot boundary detection

Journal on Image and Video Processing
Video copy detection by fast sequence matching

Proceedings of the ACM International Conference on Image and Video Retrieval
Real-time near-duplicate elimination for web video search with content and context

IEEE Transactions on Multimedia - Special issue on integration of context and content
An efficient near-duplicate video shot detection method using shot-based interest points

IEEE Transactions on Multimedia
Hierarchical modeling and adaptive clustering for real-time summarization of rush videos

IEEE Transactions on Multimedia
Activity-driven content adaptation for effective video summarization

Journal of Visual Communication and Image Representation
Automatic video archaeology: tracing your online videos

Proceedings of second ACM SIGMM workshop on Social media
Actor-independent action search using spatiotemporal vocabulary with appearance hashing

Pattern Recognition
Computational intelligence in multimedia processing

IWANN'11 Proceedings of the 11th international conference on Artificial neural networks conference on Advances in computational intelligence - Volume Part I
Browse by chunks: Topic mining and organizing on web-scale social media

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Multiple feature hashing for real-time large scale near-duplicate video retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
EMD-based video clip retrieval by many-to-many matching

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Hot event detection and summarization by graph modeling and matching

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Fast and robust short video clip search for copy detection

PCM'04 Proceedings of the 5th Pacific Rim Conference on Advances in Multimedia Information Processing - Volume Part II
Hierarchical graph-based media content representation for real time search in large scale multmedia databases

Machine Graphics & Vision International Journal - Special issue on Image Databases
Video archaeology: understanding video manipulation history

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We define similar video content as video sequences with almost identical content but possibly compressed at different qualities, reformatted to different sizes and frame-rates, undergone minor editing in either spatial or temporal domain, or summarized into keyframe sequences. Building a search engine to identify such similar content in the World-Wide Web requires: 1) robust video similarity measurements; 2) fast similarity search techniques on large databases; and 3) intuitive organization of search results. In a previous paper, we proposed a randomized technique called the video signature (ViSig) method for video similarity measurement. In this paper, we focus on the remaining two issues by proposing a feature extraction scheme for fast similarity search, and a clustering algorithm for identification of similar clusters. Similar to many other content-based methods, the ViSig method uses high-dimensional feature vectors to represent video. To warrant a fast response time for similarity searches on high dimensional vectors, we propose a novel nonlinear feature extraction scheme on arbitrary metric spaces that combines the triangle inequality with the classical Principal Component Analysis (PCA). We show experimentally that the proposed technique outperforms PCA, Fastmap, Triangle-Inequality Pruning, and Haar wavelet on signature data. To further improve retrieval performance, and provide better organization of similarity search results, we introduce a new graph-theoretical clustering algorithm on large databases of signatures. This algorithm treats all signatures as an abstract threshold graph, where the distance threshold is determined based on local data statistics. Similar clusters are then identified as highly connected regions in the graph. By measuring the retrieval performance against a ground-truth set, we show that our proposed algorithm outperforms simple thresholding, single-link and complete-link hierarchical clustering techniques.