Text Recognition of Low-resolution Document Images

Authors:
Charles Jacobs;Patrice Y. Simard;Paul Viola;James Rinker
Affiliations:
Microsoft Research;Microsoft Research;Microsoft Research;Microsoft Research
Venue:
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Year:
2005

Citing 8
Cited 2

Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks

Neural Networks
Evaluation of Binarization Methods for Document Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
LeRec: a NN/HMM hybrid for on-line handwriting recognition

Neural Computation
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Progress in Camera-Based Document Image Analysis

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Fast Lexicon-Based Word Recognition in Noisy Index Card Images

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
CamWorks: A Video-Based Tool for Efficient Capture from Paper Source Documents

ICMCS '99 Proceedings of the 1999 IEEE International Conference on Multimedia Computing and Systems - Volume 02

Recognizing natural scene characters by convolutional neural network and bimodal image enhancement

CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Large-lexicon attribute-consistent text recognition in natural images

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cheap and versatile cameras make it possible to easily and quickly capture a wide variety of documents. However, low resolution cameras present a challenge to OCR because it is virtually impossible to do character segmentation independently from recognition. In this paper we solve these problems simultaneously by applying methods borrowed from cursive handwriting recognition. To achieve maximum robustness, we use a machine learning approach based on a convolutional neural network. When our system is combined with a language model using dynamic programming, the overall performance is in the vicinity of 80-95% word accuracy on pages captured with a 1024x768 webcam and 10-point text.