Progress in Camera-Based Document Image Analysis
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Detection of Text Marks on Moving Vehicles
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
ICDAR 2003 Robust Reading Competitions
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Effective text extraction and recognition for WWW images
Proceedings of the 2003 ACM symposium on Document engineering
Photo Time-Stamp Recognition Based on Particle Swarm Optimization
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Proceedings of the 12th annual ACM international conference on Multimedia
Proceedings of the 12th annual ACM international conference on Multimedia
Video text recognition using sequential Monte Carlo and error voting methods
Pattern Recognition Letters
Multimodal content-based structure analysis of karaoke music
Proceedings of the 13th annual ACM international conference on Multimedia
Caption Localisation in Video Sequences by Fusion of Multiple Detectors
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Text Detection in Images Based on Unsupervised Classification of Edge-based Features
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Automatic detection of player's identity in soccer videos using faces and text cues
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
An embedded application for degraded text recognition
EURASIP Journal on Applied Signal Processing
Text detection, localization, and tracking in compressed video
Image Communication
Text detection and restoration in natural scene images
Journal of Visual Communication and Image Representation
Detecting text in video frames
SPPR'07 Proceedings of the Fourth conference on IASTED International Conference: Signal Processing, Pattern Recognition, and Applications
A multifunctional reading assistant for the visually impaired
Journal on Image and Video Processing
A weighted string pattern matching-based passage ranking algorithm for video question answering
Expert Systems with Applications: An International Journal
A multifunctional reading assistant for the visually impaired
Journal on Image and Video Processing
An Automatic Method for Video Character Segmentation
ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
Text Particles Multi-band Fusion for Robust Text Detection
ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
A stroke filter and its application to text localization
Pattern Recognition Letters
A Novel Video Text Detection and Localization Approach
PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
2DVTE: A two-directional videotext extractor for rapid and elaborate design
Pattern Recognition
BVideoQA: Online English-Chinese bilingual video question answering
Journal of the American Society for Information Science and Technology
Text detection in natural scene images with feature combination
SIP '07 Proceedings of the Ninth IASTED International Conference on Signal and Image Processing
An Effective Audio-Visual Information Based Framework for Extracting Highlights in Basketball Games
PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Confusion network based video OCR post-processing approach
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A novel text detection and localization method based on corner response
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A hybrid text segmentation approach
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Fast and robust text detection in images and video frames
Image and Vision Computing
Object detection using spatial histogram features
Image and Vision Computing
Detecting text in video frames
SPPRA '07 Proceedings of the Fourth IASTED International Conference on Signal Processing, Pattern Recognition, and Applications
CGIM '08 Proceedings of the Tenth IASTED International Conference on Computer Graphics and Imaging
An efficient method for text detection in video based on stroke width similarity
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
A robust caption detecting algorithm on MPEG compressed video
MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
Color-based text extraction for the image
PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Extracting text information for content-based video retrieval
MMM'08 Proceedings of the 14th international conference on Advances in multimedia modeling
A two-stage scheme for text detection in video images
Image and Vision Computing
Semantic keyword extraction via adaptive text binarization of unstructured unsourced video
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
A spatiotemporal text localization and identification approach for content-based video browsing
Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia
Text detection in images using sparse representation with discriminative dictionaries
Image and Vision Computing
Knowledge-discounted event detection in sports video
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans - Special issue on model-based diagnostics
Soccer video event detection by fusing middle level visual semantics of an event clip
PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
Localization and recognition of the scoreboard in sports video based on SIFT point matching
MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
An automated HSV based text tracking system from complex color video
ICDCIT'11 Proceedings of the 7th international conference on Distributed computing and internet technology
CCIW'11 Proceedings of the Third international conference on Computational color imaging
Enriching textbooks with images
Proceedings of the 20th ACM international conference on Information and knowledge management
A novel approach for text detection in images using structural features
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Evaluation of commercial OCR: a new goal directed methodology for video documents
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Adaptive fuzzy text segmentation in images with complex backgrounds using color and texture
CAIP'05 Proceedings of the 11th international conference on Computer Analysis of Images and Patterns
Fast photo time-stamp recognition based on SGNN
ISNN'06 Proceedings of the Third international conference on Advnaces in Neural Networks - Volume Part II
Content based image and video retrieval using embedded text
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Text detection in images using texture feature from strokes
PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Text detection in images based on color texture features
ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I
A new text detection algorithm in images/video frames
PCM'04 Proceedings of the 5th Pacific Rim Conference on Advances in Multimedia Information Processing - Volume Part II
A robust text segmentation approach in complex background based on multiple constraints
PCM'05 Proceedings of the 6th Pacific-Rim conference on Advances in Multimedia Information Processing - Volume Part I
A new passage ranking algorithm for video question answering
PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Fast rotation-invariant video caption detection based on visual rhythm
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Text extraction from videos using a hybrid approach
Proceedings of the International Conference on Advances in Computing, Communications and Informatics
A real-time scene text to speech system
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Scene text detection and tracking for a camera-equipped wearable reading assistant for the blind
ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume 2
An approach for Bangla and Devanagari video text recognition
Proceedings of the 4th International Workshop on Multilingual OCR
A framework for improved video text detection and recognition
Multimedia Tools and Applications
Optical character recognition: A comprehensive study of hybrid methods
International Journal of Knowledge-based and Intelligent Engineering Systems
Hi-index | 0.00 |
Many images, especially those used for page design on Web pages, as well as videos contain visible text. If these text occurrences could be detected, segmented, and recognized automatically, they would be a valuable source of high-level semantics for indexing and retrieval. We propose a novel method for localizing and segmenting text in complex images and videos. Text lines are identified by using a complex-valued multilayer feed-forward network trained to detect text at a fixed scale and position. The network's output at all scales and positions is integrated into a single text-saliency map, serving as a starting point for candidate text lines. In the case of video, these candidate text lines are refined by exploiting the temporal redundancy of text in video. Localized text lines are then scaled to a fixed height of 100 pixels and segmented into a binary image with black characters on white background. For videos, temporal redundancy is exploited to improve segmentation performance. Input images and videos can be of any size due to a true multiresolution approach. Moreover, the system is not only able to locate and segment text occurrences into large binary images, but is also able to track each text line with sub-pixel accuracy over the entire occurrence in a video, so that one text bitmap is created for all instances of that text line. Therefore, our text segmentation results can also be used for object-based video encoding such as that enabled by MPEG-4