A comprehensive method for multilingual video text detection, localization, and extraction

Authors:
M. R. Lyu;Jiqiang Song;Min Cai
Affiliations:
Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, China;-;-
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
2005

Citing 0
Cited 47

Content-adaptive wireless streaming of instructional videos

Multimedia Tools and Applications
Color-based clustering for text detection and extraction in image

Proceedings of the 15th international conference on Multimedia
Text detection, localization, and tracking in compressed video

Image Communication
A weighted string pattern matching-based passage ranking algorithm for video question answering

Expert Systems with Applications: An International Journal
Morphology-based text line extraction

Machine Vision and Applications
A Heuristic Approach to Caption Enhancement for Effective Video OCR

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Theoretical and Methodological Issues
A configurable method for multi-style license plate recognition

Pattern Recognition
A stroke filter and its application to text localization

Pattern Recognition Letters
A Novel Video Text Detection and Localization Approach

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
2DVTE: A two-directional videotext extractor for rapid and elaborate design

Pattern Recognition
BVideoQA: Online English-Chinese bilingual video question answering

Journal of the American Society for Information Science and Technology
An Automatic Video Text Detection, Localization and Extraction Approach

Advanced Internet Based Systems and Applications
Accurate text localization in images based on SVM output scores

Image and Vision Computing
Content-based attention ranking using visual and contextual attention model for baseball videos

IEEE Transactions on Multimedia - Special issue on integration of context and content
A new approach for overlay text detection and extraction from complex video scene

IEEE Transactions on Image Processing
A novel text detection and localization method based on corner response

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A new video text extraction approach

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Video retargeting with multi-scale trajectory optimization

Proceedings of the international conference on Multimedia information retrieval
Accurate video text detection through classification of low and high contrast images

Pattern Recognition
An efficient method for text detection in video based on stroke width similarity

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
Color-based text extraction for the image

PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
A novel approach for captions detection in video sequences

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
A two-stage scheme for text detection in video images

Image and Vision Computing
A spatiotemporal text localization and identification approach for content-based video browsing

Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia
Text detection in images using sparse representation with discriminative dictionaries

Image and Vision Computing
Precise news video text detection/localization based on multiple frames integration

ISCGAV'10 Proceedings of the 10th WSEAS international conference on Signal processing, computational geometry and artificial vision
Improving computer vision-based indoor wayfinding for blind persons with context information

ICCHP'10 Proceedings of the 12th international conference on Computers helping people with special needs
A video text detection method based on key text points

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
A new pivoting and iterative text detection algorithm for biomedical images

Journal of Biomedical Informatics
Soccer video event detection by fusing middle level visual semantics of an event clip

PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
A novel approach for robust surveillance video content abstraction

PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
Text detection in natural images based on character classification

PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
A novel mutual nearest neighbor based symmetry for text frame classification in video

Pattern Recognition
Robust news video text detection based on edges and line-deletion

WSEAS Transactions on Signal Processing
A new video text detection method

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Web video search by mutual boosting between the inside and outside text of video

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Text extraction using component analysis and neuro-fuzzy classification on complex backgrounds

SCIA'11 Proceedings of the 17th Scandinavian conference on Image analysis
A new video images text localization approach based on a fast hough transform

ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part I
A new passage ranking algorithm for video question answering

PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Using adaptive run length smoothing algorithm for accurate text localization in images

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
A robust video text detection approach using SVM

Expert Systems with Applications: An International Journal
HMM based soccer video event detection using enhanced mid-level semantic

Multimedia Tools and Applications
Text extraction from videos using a hybrid approach

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Scene text detection and tracking for a camera-equipped wearable reading assistant for the blind

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume 2
Inscription extraction from Traditional Chinese Painting images

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Distinction between handwritten and machine-printed text based on the bag of visual words model

Pattern Recognition
Research on born-digital image text extraction based on conditional random field

International Journal of High Performance Systems Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

Text in video is a very compact and accurate clue for video indexing and summarization. Most video text detection and extraction methods hold assumptions on text color, background contrast, and font style. Moreover, few methods can handle multilingual text well since different languages may have quite different appearances. This paper performs a detailed analysis of multilingual text characteristics, including English and Chinese. Based on the analysis, we propose a comprehensive, efficient video text detection, localization, and extraction method, which emphasizes the multilingual capability over the whole processing. The proposed method is also robust to various background complexities and text appearances. The text detection is carried out by edge detection, local thresholding, and hysteresis edge recovery. The coarse-to-fine localization scheme is then performed to identify text regions accurately. The text extraction consists of adaptive thresholding, dam point labeling, and inward filling. Experimental results on a large number of video images and comparisons with other methods are reported in detail.