Matching slides to presentation videos using SIFT and scene background matching

Authors:
Quanfu Fan;Kobus Barnard;Arnon Amir;Alon Efrat;Ming Lin
Affiliations:
University of Arizona, Tucson, AZ;University of Arizona, Tucson, AZ;IBM Almaden Research Center, San Jose, CA;University of Arizona, Tucson, AZ;University of Arizona, Tucson, AZ
Venue:
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Year:
2006

Citing 11
Cited 9

Comparing images using color coherence vectors

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Teaching and learning as multimedia authoring: the classroom 2000 project

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Passive capture and structuring of lectures

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Communications of the ACM
Multiple view geometry in computer visiond

Multiple view geometry in computer visiond
Computer and Robot Vision

Computer and Robot Vision
Synchronization of lecture videos and electronic slides by video text analysis

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Linking multimedia presentations with their symbolic source documents: algorithm and applications

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Automatic generation of conference video proceedings

Journal of Visual Communication and Image Representation

Robust Alignment of Presentation Videos with Slides

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
A Novel Retake Detection Using LCS and SIFT Algorithm

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Studying on the move: enriched presentation video for mobile devices

INFOCOM'09 Proceedings of the 28th IEEE international conference on Computer Communications Workshops
Document retrieval using image features

Proceedings of the 2010 ACM Symposium on Applied Computing
Establishing videoconferencing infrastructure in R. Macedonia

ITHET'10 Proceedings of the 9th international conference on Information technology based higher education and training
Restoration of out-of-focus lecture video by automatic slide matching

Proceedings of the international conference on Multimedia
Imposing hierarchical browsing structures onto spoken documents

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A normalized-cut alignment model for mapping hierarchical semantic structures onto spoken documents

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Client-side backprojection of presentation slides into educational video

Proceedings of the 20th ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a general approach for automatically matching electronic slides to videos of corresponding presentations for use in distance learning and video proceedings of conferences. We deal with a large variety of videos, various frame compositions and color balances, arbitrary slides sequence and with dynamic cameras switching, pan, tilt and zoom. To achieve high accuracy, we develop a two-phases process with unsupervised scene background modelling. In the first phase, scale invariant feature transform (SIFT) keypoints are applied to frame to slide matching, under constraint projective transformation (constraint homography) using a random sample consensus (RANSAC). Successful first-phase matches are then used to automatically build a scene background model. In the second phase the background model is applied to the remaining unmatched frames to boost the matching performance for difficult cases such as wide field of view camera shots where the slide shows as a small portion of the frame. We also show that color correction is helpful when color-related similarity measures are used for identifying slides. We provide detailed quantitative experimentation results characterizing the effect of each part of our approach. The results show that our approach is robust and achieves high performance on matching slides to a number of videos with different styles.