Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis

Authors:
Feng Wang;Chong-Wah Ngo;Ting-Chuen Pong
Affiliations:
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong;Department of Computer Science, City University of Hong Kong, Hong Kong;Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong
Venue:
Pattern Recognition
Year:
2008

Citing 29
Cited 3

Algorithms for approximate string matching

Information and Control
Automatic partitioning of full-motion video

Multimedia Systems
Teaching and learning as multimedia authoring: the classroom 2000 project

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
VideoQ: an automated content based video search system using visual cues

MULTIMEDIA '97 Proceedings of the fifth ACM international conference on Multimedia
Text enhancement in digital video using multiple frame integration

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Passive capture and structuring of lectures

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Automatic text segmentation and text recognition for video indexing

Multimedia Systems
Automatic Caption Localization in Compressed Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Detection of text captions in compressed domain video

MULTIMEDIA '00 Proceedings of the 2000 ACM workshops on Multimedia
Query by Image and Video Content: The QBIC System

Computer
Goal-Directed Evaluation of Binarization Methods

IEEE Transactions on Pattern Analysis and Machine Intelligence
Limits on Super-Resolution and How to Break Them

IEEE Transactions on Pattern Analysis and Machine Intelligence
High level segmentation of instructional videos based on content density

Proceedings of the tenth ACM international conference on Multimedia
Video OCR: indexing digital new libraries by recognition of superimposed captions

Multimedia Systems - Special section on video libraries
Video OCR for Digital News Archive

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Automatic Detection of Signs with Affine Transformation

WACV '02 Proceedings of the Sixth IEEE Workshop on Applications of Computer Vision
Automatic Text Extraction from Video for Content-Based Annotation and Retrieval

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 1 - Volume 1
Automatic Performance Evaluation for Video Text Detection

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Linking multimedia presentations with their symbolic source documents: algorithm and applications

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Space-Time Super-Resolution

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video Super-Resolution Using Controlled Subpixel Detector Shifts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Content-based query processing for video databases

IEEE Transactions on Multimedia
A real-time interactive virtual classroom multimedia distancelearning system

IEEE Transactions on Multimedia
Automatic text detection and tracking in digital video

IEEE Transactions on Image Processing
Hierarchical browsing and search of large image databases

IEEE Transactions on Image Processing
Automatic detection and recognition of signs from natural scenes

IEEE Transactions on Image Processing
Rapid scene analysis on compressed video

IEEE Transactions on Circuits and Systems for Video Technology
Summarization of videotaped presentations: automatic analysis of motion and gesture

IEEE Transactions on Circuits and Systems for Video Technology
Localizing and segmenting text in images and videos

IEEE Transactions on Circuits and Systems for Video Technology

Semantic keyword extraction via adaptive text binarization of unstructured unsourced video

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
A novel mutual nearest neighbor based symmetry for text frame classification in video

Pattern Recognition
An automated analysis and indexing framework for lecture video portal

ICWL'12 Proceedings of the 11th international conference on Advances in Web-Based Learning

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper presents an unified approach in analyzing and structuring the content of videotaped lectures for distance learning applications. By structuring lecture videos, we can support topic indexing and semantic querying of multimedia documents captured in the traditional classrooms. Our goal in this paper is to automatically construct the cross references of lecture videos and textual documents so as to facilitate the synchronized browsing and presentation of multimedia information. The major issues involved in our approach are topical event detection, video text analysis and the matching of slide shots and external documents. In topical event detection, a novel transition detector is proposed to rapidly locate the slide shot boundaries by computing the changes of text and background regions in videos. For each detected topical event, multiple keyframes are extracted for video text detection, super-resolution reconstruction, binarization and recognition. A new approach for the reconstruction of high-resolution textboxes based on linear interpolation and multi-frame integration is also proposed for the effective binarization and recognition. The recognized characters are utilized to match the video slide shots and external documents based on our proposed title and content similarity measures.