Text detection, localization, and tracking in compressed video

Authors:
Xueming Qian;Guizhong Liu;Huan Wang;Rui Su
Affiliations:
School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China;School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China;School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China;School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
Venue:
Image Communication
Year:
2007

Citing 26
Cited 7

Text enhancement in digital video using multiple frame integration

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
TextFinder: An Automatic System to Detect and Recognize Text In Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic Caption Localization in Compressed Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video OCR: indexing digital new libraries by recognition of superimposed captions

Multimedia Systems - Special section on video libraries
Video OCR for Digital News Archive

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Text Detection for Video Analysis

CBAIVL '99 Proceedings of the IEEE Workshop on Content-Based Access of Image and Video Libraries
Character extraction of license plates from video

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
A System for Automatic Text Detection in Video

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Indexing Text Events in Digital Video Databases

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 1 - Volume 1
Robust Detection of Stylized Text Events in Digital Video

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Text Extraction in MPEG Compressed Video for Content-Based Indexing

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Locating Uniform-Colored Text in Video Frames

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Automatic text detection and removal in video sequences

Pattern Recognition Letters
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Automatic Text Location in Images and Video Frames

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
A Novel Video Caption Detection Approach Using Multi-Frame Integration

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
A Generic Framework for Semantic Sports Video Analysis Using Dynamic Bayesian Networks

MMM '05 Proceedings of the 11th International Multimedia Modelling Conference
Text detection and segmentation in complex color images

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Fast and robust text detection in images and video frames

Image and Vision Computing
Shot clustering techniques for story browsing

IEEE Transactions on Multimedia
Multimedia event-based video indexing using time intervals

IEEE Transactions on Multimedia
Automatic text detection and tracking in digital video

IEEE Transactions on Image Processing
Rapid scene analysis on compressed video

IEEE Transactions on Circuits and Systems for Video Technology
Localizing and segmenting text in images and videos

IEEE Transactions on Circuits and Systems for Video Technology
A comprehensive method for multilingual video text detection, localization, and extraction

IEEE Transactions on Circuits and Systems for Video Technology
Effective Fades and Flashlight Detection Based on Accumulating Histogram Difference

IEEE Transactions on Circuits and Systems for Video Technology

Fuzzy intensification operator based contrast enhancement in the compressed domain

Applied Soft Computing
A novel text detection and localization method based on corner response

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A video text detection method based on key text points

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
A new pivoting and iterative text detection algorithm for biomedical images

Journal of Biomedical Informatics
A novel mutual nearest neighbor based symmetry for text frame classification in video

Pattern Recognition
HMM based soccer video event detection using enhanced mid-level semantic

Multimedia Tools and Applications
A framework for improved video text detection and recognition

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Video text information plays an important role in semantic-based video analysis, indexing and retrieval. Video texts are closely related to the content of a video. Usually, the fundamental steps of text-based video analysis, browsing and retrieval consist of video text detection, localization, tracking, segmentation and recognition. Video sequences are commonly stored in compressed formats where MPEG coding techniques are often adopted. In this paper, a unified framework for text detection, localization, and tracking in compressed videos using the discrete cosines transform (DCT) coefficients is proposed. A coarse to fine text detection method is used to find text blocks in terms of the block DCT texture intensity information. The DCT texture intensity of an 8x8 block of an intra-frame is approximately represented by seven AC coefficients. The candidate text block regions are further verified and refined. The text block region localization and tracking are carried out by virtue of the horizontal and vertical block texture intensity projection profiles. The appearing and disappearing frames of each text line are determined by the text tracking. The final experimental results show the effectiveness of the proposed methods.