Video shots retrieval using local invariant features

Authors:
Yuanjia Du;Ling Shao
Affiliations:
NXP Semiconductors, Eindhoven, Netherlands;Philips Research Laboratories, Eindhoven, Netherlands
Venue:
IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Year:
2009

Citing 14
Cited 0

A Computational Approach to Edge Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Design and Use of Steerable Filters

IEEE Transactions on Pattern Analysis and Machine Intelligence
Local Grayvalue Invariants for Image Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Shape Matching and Object Recognition Using Shape Contexts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automated Scene Matching in Movies

CIVR '02 Proceedings of the International Conference on Image and Video Retrieval
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Evaluation of Features Detectors and Descriptors Based on 3D Objects

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Robust Fragments-based Tracking using the Integral Histogram

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
On the Use of SIFT Features for Face Authentication

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
PCA-SIFT: a more distinctive representation for local image descriptors

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Flexible spatial models for grouping local image features

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present an efficient video shots retrieval system based on local feature detection, description and matching. A face tracker is first used to obtain information on faces in different viewpoints. A visual vocabulary is built off-line using an invariant descriptor computed on tracked character face regions in all shots. The vocabulary is refined in two ways to make the retrieval system more efficient. Firstly, the visual vocabulary is minimized by only using facial features selected on face regions which are detected by an accurate face detector. Secondly, three criteria, namely Inverted-Occurrence-Frequency Weights, Average Feature Location Distance and Reliable Nearest-Neighbors, are calculated in advance to make the on-line retrieval procedure more efficient and precise. The proposed system is experimented on the movie "Groundhog Day". The results show that our technique is very effective and efficient on video shots retrieval.