Automatic text recognition for video indexing
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
TextFinder: An Automatic System to Detect and Recognize Text In Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Video OCR for Digital News Archive
CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Automatic text detection and removal in video sequences
Pattern Recognition Letters
A road sign recognition system based on dynamic visual model
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Tracking appearances with occlusions
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Automatic text detection and tracking in digital video
IEEE Transactions on Image Processing
Automatic detection and recognition of signs from natural scenes
IEEE Transactions on Image Processing
Localizing and segmenting text in images and videos
IEEE Transactions on Circuits and Systems for Video Technology
Multimodal fusion using learned text concepts for image categorization
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Object fingerprints for content analysis with applications to street landmark localization
MM '08 Proceedings of the 16th ACM international conference on Multimedia
NEOCR: a configurable dataset for natural image text recognition
CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Scale based region growing for scene text detection
Proceedings of the 21st ACM international conference on Multimedia
Hi-index | 0.00 |
This paper proposes a fast and robust framework for incrementally detecting text on road signs from natural scene video. The new framework makes two main contributions. First, the framework applies a Divide-and-Conquer strategy to decompose the original task into two sub-tasks, that is, localization of road signs and detection of text. The algorithms for the two sub-tasks are smoothly incorporated into a unified framework through a real time tracking algorithm. Second, the framework provides a novel way for text detection from video by integrating 2D features in each video frame (e.g., color, edges, texture) with 3D information available in a video sequence (e.g., object structure). The feasibility of the proposed framework has been evaluated on the video sequences captured from a moving vehicle. The new framework can be applied to a driving assistant system and other tasks of text detection from video.