A spatial-temporal approach for video caption detection and recognition

  • Authors:
  • Xiaoou Tang;Xinbo Gao;Jianzhuang Liu;Hongjiang Zhang

  • Affiliations:
  • Dept. of Inf. Eng., Chinese Univ. of Hong Kong;-;-;-

  • Venue:
  • IEEE Transactions on Neural Networks
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a video caption detection and recognition system based on a fuzzy-clustering neural network (FCNN) classifier. Using a novel caption-transition detection scheme we locate both spatial and temporal positions of video captions with high precision and efficiency. Then employing several new character segmentation and binarization techniques, we improve the Chinese video-caption recognition accuracy from 13% to 86% on a set of news video captions. As the first attempt on Chinese video-caption recognition, our experiment results are very encouraging.