A Novel Video Caption Detection Approach Using Multi-Frame Integration

  • Authors:
  • Rongrong Wang;Wanjun Jin;Lide Wu

  • Affiliations:
  • Fudan University, Shanghai, China;Fudan University, Shanghai, China;Fudan University, Shanghai, China

  • Venue:
  • ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Captions in videos often play an important role in video information indexing and retrieval. In this paper, we present a novel video caption detection approach. We first apply a new Multiple Frames Integration (MFI) method to minimize the variation of the background of the image. A time-based minimum (or maximum)pixel value search is employed and Sobel edge map is used to determine the mode of search. Then block-based text detection is performed, i.e. a small window is used to scan the image and classified as text or non-text, using Sobel edges as features. We use a two-level pyramid to detect various text sizes. Finally, we present a new iterative text line decomposition method and accurate text bounding boxes are extracted from candidate text areas. Experimental result shows that the proposed approach achieves a high precision and recall.