Incremental detection of text on road signs from video with application to a driving assistant system

Authors:
Wen Wu;Xilin Chen;Jie Yang
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the 12th annual ACM international conference on Multimedia
Year:
2004

Citing 9
Cited 5

Automatic text recognition for video indexing

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
TextFinder: An Automatic System to Detect and Recognize Text In Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video OCR for Digital News Archive

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Automatic text detection and removal in video sequences

Pattern Recognition Letters
A road sign recognition system based on dynamic visual model

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Tracking appearances with occlusions

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Automatic text detection and tracking in digital video

IEEE Transactions on Image Processing
Automatic detection and recognition of signs from natural scenes

IEEE Transactions on Image Processing
Localizing and segmenting text in images and videos

IEEE Transactions on Circuits and Systems for Video Technology

Multimodal fusion using learned text concepts for image categorization

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Portable translator capable of recognizing characters on signboard and menu captured by built-in camera

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Object fingerprints for content analysis with applications to street landmark localization

MM '08 Proceedings of the 16th ACM international conference on Multimedia
NEOCR: a configurable dataset for natural image text recognition

CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Scale based region growing for scene text detection

Proceedings of the 21st ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a fast and robust framework for incrementally detecting text on road signs from natural scene video. The new framework makes two main contributions. First, the framework applies a Divide-and-Conquer strategy to decompose the original task into two sub-tasks, that is, localization of road signs and detection of text. The algorithms for the two sub-tasks are smoothly incorporated into a unified framework through a real time tracking algorithm. Second, the framework provides a novel way for text detection from video by integrating 2D features in each video frame (e.g., color, edges, texture) with 3D information available in a video sequence (e.g., object structure). The feasibility of the proposed framework has been evaluated on the video sequences captured from a moving vehicle. The new framework can be applied to a driving assistant system and other tasks of text detection from video.