Localizing and segmenting text in images and videos

Authors:
R. Lienhart;A. Wernicke
Affiliations:
Intel Corp., Santa Clara, CA;-
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
2002

Citing 0
Cited 71

Progress in Camera-Based Document Image Analysis

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Detection of Text Marks on Moving Vehicles

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
ICDAR 2003 Robust Reading Competitions

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Effective text extraction and recognition for WWW images

Proceedings of the 2003 ACM symposium on Document engineering
Photo Time-Stamp Recognition Based on Particle Swarm Optimization

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Tracking text in MPEG videos

Proceedings of the 12th annual ACM international conference on Multimedia
Incremental detection of text on road signs from video with application to a driving assistant system

Proceedings of the 12th annual ACM international conference on Multimedia
Integrating Highlights for More Complete Sports Video Summarization

IEEE MultiMedia
Video text recognition using sequential Monte Carlo and error voting methods

Pattern Recognition Letters
Multimodal content-based structure analysis of karaoke music

Proceedings of the 13th annual ACM international conference on Multimedia
Caption Localisation in Video Sequences by Fusion of Multiple Detectors

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Text Detection in Images Based on Unsupervised Classification of Edge-based Features

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
OCR Based Slide Retrieval

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Automatic detection of player's identity in soccer videos using faces and text cues

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
An embedded application for degraded text recognition

EURASIP Journal on Applied Signal Processing
Text detection, localization, and tracking in compressed video

Image Communication
Text detection and restoration in natural scene images

Journal of Visual Communication and Image Representation
Detecting text in video frames

SPPR'07 Proceedings of the Fourth conference on IASTED International Conference: Signal Processing, Pattern Recognition, and Applications
A multifunctional reading assistant for the visually impaired

Journal on Image and Video Processing
A weighted string pattern matching-based passage ranking algorithm for video question answering

Expert Systems with Applications: An International Journal
Fast communication: A new approach for text segmentation using a stroke filter

Signal Processing
A multifunctional reading assistant for the visually impaired

Journal on Image and Video Processing
Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis

Pattern Recognition
An Automatic Method for Video Character Segmentation

ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
Text Particles Multi-band Fusion for Robust Text Detection

ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
A stroke filter and its application to text localization

Pattern Recognition Letters
A Novel Video Text Detection and Localization Approach

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
2DVTE: A two-directional videotext extractor for rapid and elaborate design

Pattern Recognition
BVideoQA: Online English-Chinese bilingual video question answering

Journal of the American Society for Information Science and Technology
Text detection in natural scene images with feature combination

SIP '07 Proceedings of the Ninth IASTED International Conference on Signal and Image Processing
An Effective Audio-Visual Information Based Framework for Extracting Highlights in Basketball Games

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Confusion network based video OCR post-processing approach

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A novel text detection and localization method based on corner response

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A hybrid text segmentation approach

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Fast and robust text detection in images and video frames

Image and Vision Computing
Object detection using spatial histogram features

Image and Vision Computing
Detecting text in video frames

SPPRA '07 Proceedings of the Fourth IASTED International Conference on Signal Processing, Pattern Recognition, and Applications
An improved edge-based text region segmentation algorithm applied to slab image data from steel plant

CGIM '08 Proceedings of the Tenth IASTED International Conference on Computer Graphics and Imaging
Accurate video text detection through classification of low and high contrast images

Pattern Recognition
An efficient method for text detection in video based on stroke width similarity

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
A robust caption detecting algorithm on MPEG compressed video

MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
Color-based text extraction for the image

PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Extracting text information for content-based video retrieval

MMM'08 Proceedings of the 14th international conference on Advances in multimedia modeling
A two-stage scheme for text detection in video images

Image and Vision Computing
Semantic keyword extraction via adaptive text binarization of unstructured unsourced video

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
A spatiotemporal text localization and identification approach for content-based video browsing

Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia
Text detection in images using sparse representation with discriminative dictionaries

Image and Vision Computing
Knowledge-discounted event detection in sports video

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans - Special issue on model-based diagnostics
Soccer video event detection by fusing middle level visual semantics of an event clip

PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
Localization and recognition of the scoreboard in sports video based on SIFT point matching

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
A novel mutual nearest neighbor based symmetry for text frame classification in video

Pattern Recognition
An automated HSV based text tracking system from complex color video

ICDCIT'11 Proceedings of the 7th international conference on Distributed computing and internet technology
Detecting text in natural scenes based on a reduction of photometric effects: problem of text detection

CCIW'11 Proceedings of the Third international conference on Computational color imaging
Enriching textbooks with images

Proceedings of the 20th ACM international conference on Information and knowledge management
A novel approach for text detection in images using structural features

ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Evaluation of commercial OCR: a new goal directed methodology for video documents

ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Adaptive fuzzy text segmentation in images with complex backgrounds using color and texture

CAIP'05 Proceedings of the 11th international conference on Computer Analysis of Images and Patterns
Fast photo time-stamp recognition based on SGNN

ISNN'06 Proceedings of the Third international conference on Advnaces in Neural Networks - Volume Part II
Content based image and video retrieval using embedded text

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Text detection in images using texture feature from strokes

PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Text detection in images based on color texture features

ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
A new text detection algorithm in images/video frames

PCM'04 Proceedings of the 5th Pacific Rim Conference on Advances in Multimedia Information Processing - Volume Part II
A robust text segmentation approach in complex background based on multiple constraints

PCM'05 Proceedings of the 6th Pacific-Rim conference on Advances in Multimedia Information Processing - Volume Part I
A new passage ranking algorithm for video question answering

PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Fast rotation-invariant video caption detection based on visual rhythm

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Text extraction from videos using a hybrid approach

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
A real-time scene text to speech system

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Scene text detection and tracking for a camera-equipped wearable reading assistant for the blind

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume 2
An approach for Bangla and Devanagari video text recognition

Proceedings of the 4th International Workshop on Multilingual OCR
A framework for improved video text detection and recognition

Multimedia Tools and Applications
Optical character recognition: A comprehensive study of hybrid methods

International Journal of Knowledge-based and Intelligent Engineering Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many images, especially those used for page design on Web pages, as well as videos contain visible text. If these text occurrences could be detected, segmented, and recognized automatically, they would be a valuable source of high-level semantics for indexing and retrieval. We propose a novel method for localizing and segmenting text in complex images and videos. Text lines are identified by using a complex-valued multilayer feed-forward network trained to detect text at a fixed scale and position. The network's output at all scales and positions is integrated into a single text-saliency map, serving as a starting point for candidate text lines. In the case of video, these candidate text lines are refined by exploiting the temporal redundancy of text in video. Localized text lines are then scaled to a fixed height of 100 pixels and segmented into a binary image with black characters on white background. For videos, temporal redundancy is exploited to improve segmentation performance. Input images and videos can be of any size due to a true multiresolution approach. Moreover, the system is not only able to locate and segment text occurrences into large binary images, but is also able to track each text line with sub-pixel accuracy over the entire occurrence in a video, so that one text bitmap is created for all instances of that text line. Therefore, our text segmentation results can also be used for object-based video encoding such as that enabled by MPEG-4