Summarization of Visual Content in Instructional Videos

Authors:
C. Choudary;Tiecheng Liu
Affiliations:
Univ. of South Carolina, Columbia;-
Venue:
IEEE Transactions on Multimedia
Year:
2007

Citing 0
Cited 6

Automatic creation and evaluation of MPEG-7 compliant summary descriptions for generic audiovisual content

Image Communication
Rushes video summarization using audio-visual information and sequence alignment

TVS '08 Proceedings of the 2nd ACM TRECVid Video Summarization Workshop
A web-based lecture scene playback system using digital picture-card show with structured learning assistance function

International Journal of Web and Grid Services
A business model for mobile commerce applications using multimedia messaging service

International Journal of Business Information Systems
Hierarchical keyframe-based video summarization using QR-decomposition and modified k-means clustering

EURASIP Journal on Advances in Signal Processing
Digital learning video indexing using scene detection

ICHL'11 Proceedings of the 4th international conference on Hybrid learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

In instructional videos of chalk board presentations, the visual content refers to the text and figures written on the boards. Existing methods on video summarization are not effective for this video domain because they are mainly based on low-level image features such as color and edges. In this work, we present a novel approach to summarizing the visual content in instructional videos using middle-level features. We first develop a robust algorithm to extract content text and figures from instructional videos by statistical modelling and clustering. This algorithm addresses the image noise, nonuniformity of the board regions, camera movements, occlusions, and other challenges in the instructional videos that are recorded in real classrooms. Using the extracted text and figures as the middle level features, we retrieve a set of key frames that contain most of the visual content. We further reduce content redundancy and build a mosaicked summary image by matching extracted content based on K-th Hausdorff distance and connected component decomposition. Performance evaluation on four full-length instructional videos shows that our algorithm is highly effective in summarizing instructional video content.