A robust caption detecting algorithm on MPEG compressed video

  • Authors:
  • Yaowei Wang;Limin Su;Qixiang Ye

  • Affiliations:
  • Department of Electronic Engineering, Beijing Institute of Technology, Beijing, P.R. China;Information School, Beijing Union University, Beijing, P.R. China;Graduate School of Chinese Academy of Science, Beijing, P.R. China

  • Venue:
  • MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Captions (or overlay texts) play an important role in video content understanding. In this paper, an algorithm is proposed to detect captions in MPEG compressed video. First, energy features, which are used to find candidate caption blocks, are extracted from DCT coefficients. Second, temporal information is employed to verify these candidate blocks. Then, a new region growing method named "density-based region growing" is proposed to connect these blocks into candidate text regions. Finally, the regions are identified as caption or non-caption by structural information of caption regions. Experiments are conducted on news videos and it is shown that the algorithm is feasible and effective in finding captions.