A Flexible Vision-Based Algorithm for a Book Sorting System
IEEE Transactions on Pattern Analysis and Machine Intelligence - Special Issue on Industrial Machine Vision and Computer Vision Technology:8MPart
A Theory for Multiresolution Signal Decomposition: The Wavelet Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to statistical pattern recognition (2nd ed.)
Introduction to statistical pattern recognition (2nd ed.)
Text segmentation using Gabor filters for automatic document processing
Machine Vision and Applications - Special issue: document image analysis techniques
Automatic text recognition for video indexing
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Recognizing Characters in Scene Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Enhancing Degraded Document Images via Bitmap Clustering and Averaging
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Automatic Text Extraction from Video for Content-Based Annotation and Retrieval
ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 1 - Volume 1
Example Based Learning for View-Based Human Face Detection
Example Based Learning for View-Based Human Face Detection
Automatic Text Location in Images and Video Frames
ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
An Automatic Video Text Detection, Localization and Extraction Approach
Advanced Internet Based Systems and Applications
Hi-index | 0.00 |
In this paper we address the problem of text extraction, enhancement and recognition in digital video. Compared with optical character recognition (OCR) from document images, text extraction and recognition in digital video presents several new challenges. First, the text in video is often embedded in complex backgrounds, making text extraction and separation difficult. Second, image data contained in video frames is often digitized and/or subsampled at a much lower resolution than is typical for document images. As a result, most commercial OCR software can not recognize text extracted from video. We have implemented a hybrid wavelet/neural network segmenter to extract text regions and use a two stage enhancement scheme prior to recognition. First, we use Shannon interpolation to raise the image resolution, and second we postprocess the block with normal/inverse text classification and adaptive thresholding. Experimental results show that our text extraction scheme can extract both scene text and graphical text robustly and reasonable OCR results are achieved after enhancement.