Accurate alignment of presentation slides with educational video

  • Authors:
  • Quanfu Fan;Kobus Barnard;Arnon Amir;Alon Efrat

  • Affiliations:
  • Department of Computer Science, University of Arizona, Tucson, AZ and IBM T. J. Watson Research Center, Hawthorne, NY;Department of Computer Science, University of Arizona, Tucson, AZ;IBM Almaden Research Center, San Jose, CA;Department of Computer Science, University of Arizona, Tucson, AZ

  • Venue:
  • ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
  • Year:
  • 2009

Quantified Score

Hi-index 0.03

Visualization

Abstract

Spatio-temporal alignment of electronic slides with corresponding presentation video opens up a number of possibilities for making the instructional content more accessible and understandable, such as video quality improvement, better content analysis and novel compression approaches for low bandwidth access. However, these applications need finding accurate transformations between slides and video frames, which is quite challenging in capture settings using pan-tilt-zoom (PTZ) cameras. In this paper we present a nonlinear optimization approach for accurate registration of slide images to video frames. Instead of estimating the projective transformation (i.e., homography) between a single pair of slide and frame images, we solve a set of homographies jointly in a frame sequence that is associated with a given slide. Quantitative evaluation confirms that this substantively improves alignment accuracy.