The use of temporal, semantic and visual partitioning model for efficient near-duplicate keyframe detection in large scale news corpus

Authors:
Yan-Tao Zheng;Shi-Yong Neo;Tat-Seng Chua;Qi Tian
Affiliations:
National University of Singapore, Singapore;National University of Singapore, Singapore;National University of Singapore, Singapore;Institute for Infocomm Research, Singapore
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 13
Cited 4

Image Indexing Using Color Correlograms

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Moment invariants for recognition under changing viewpoint and illumination

Computer Vision and Image Understanding - Special issue on color for image indexing and retrieval
Towards auto-documentary: tracking the evolution of news stories

Proceedings of the 12th annual ACM international conference on Multimedia
An efficient parts-based near-duplicate and sub-image retrieval system

Proceedings of the 12th annual ACM international conference on Multimedia
Detecting image near-duplicate by stochastic attributed relational graph matching with learning

Proceedings of the 12th annual ACM international conference on Multimedia
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparison of Affine Region Detectors

International Journal of Computer Vision
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
PCA-SIFT: a more distinctive representation for local image descriptors

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Video retrieval using high level features: exploiting query matching and confidence-based weighting

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
ARGOS: automatically extracting repeating objects from multimedia streams

IEEE Transactions on Multimedia

News shot cloud: ranking TV news shots by cross TV-channel filtering for efficient browsing of large-scale news video archives

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
BASIL: effective near-duplicate image detection using gene sequence alignment

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Sensing geographical impact factor of multimedia news events for localized retrieval and news filtering

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Bayesian approach for near-duplicate image detection

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Near-duplicate keyframes (NDKs) are important visual cues to link news stories from different TV channel, time, language, etc. However, the quadratic complexity required for NDK detection renders it intractable in large-scale news video corpus. To address this issue, we propose a temporal, semantic and visual partitioning model to divide the corpus into small overlapping partitions by exploiting domain knowledge and corpus characteristics. This enables us to efficiently detect NDKs in each partition separately and then link them together across partitions. We divide the corpus temporally into sequential partitions and semantically into news story genre groups; and within each partition, we visually group potential NDKs by using asymmetric hierarchical k-means clustering on our proposed semi-global image features. In each visual group, we detect NDK pairs by exploiting our proposed SIFT-based fast keypoint matching scheme based on local color information of keypoints. Finally, the detected NDK groups in each partition are linked up via transitivity propagation of NDKs shared by different partitions. The testing on TRECVID 06 corpus with 62k keyframes shows that our proposed approach could result in multifold increase in speed as compared to the best reported approach and complete the NDK detection in a manageable time with satisfactory accuracy.