Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video

Authors:
Duan-Yu Chen;Ming-Ho Hsiao;Suh-Yin Lee
Affiliations:
-;-;-
Venue:
VISUAL '02 Proceedings of the 5th International Conference on Recent Advances in Visual Information Systems
Year:
2002

Citing 8
Cited 1

Automatic Caption Localization in Compressed Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Caption processing for MPEG video in MC-DCT compressed domain

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
On face detection in the compressed domain

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Detection of text captions in compressed domain video

MULTIMEDIA '00 Proceedings of the 2000 ACM workshops on Multimedia
MPEG Video Compression Standard

MPEG Video Compression Standard
Text Area Detection from Video Frames

PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Fast scene change detection using direct feature extraction fromMPEG compressed videos

IEEE Transactions on Multimedia
A highly efficient system for automatic face region detection in MPEG video

IEEE Transactions on Circuits and Systems for Video Technology

Fast rotation-invariant video caption detection based on visual rhythm

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a novel approach of automatic closed caption detection and font size differentiation among localized text regions in I-frames of MPEG videos is proposed. The approach consists of five modules: video segmentation, shot selection, caption frame detection, caption localization and font size differentiation. Rather than directly examines scene cut frame by frame, the module of video segmentation first verifies video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Tennis videos are selected as the case study and the module of shot selection is designed to automatically select specific type of shot for further closed caption detection. The noise of potential captions is filtered out based on its long-term consistency over consecutive frames. While the general closed captions are localized, we select the specific caption that is discriminated utilizing the module of font size differentiation. The detected closed captions can support video structuring, video browsing, high-level video indexing and video content description in MPEG-7. Experimental results show the effectiveness and the feasibility of the proposed scheme.