High Accuracy Optical Character Recognition Using Neural Networks with Centroid Dithering

Authors:
Hadar I. Avi-Itzhak;Thanh A. Diep;Harry Garland
Affiliations:
-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
1995

Citing 4
Cited 10

On the Recognition of Printed Characters of Any Font and Size

IEEE Transactions on Pattern Analysis and Machine Intelligence
Neural Nets for Adaptive Filtering and Adaptive Pattern Recognition

Computer
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Profiles in document managing

BYTE

Twenty Years of Document Image Analysis in PAMI

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Contour Code Feature Based Segmentation For Handwriting Recognition

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Hybrid generative/discriminative classifier for unconstrained character recognition

Pattern Recognition Letters - Special issue: Artificial neural networks in pattern recognition
Ottoman archives explorer: A retrieval system for digital Ottoman archives

Journal on Computing and Cultural Heritage (JOCCH)
Dynamic structure-based neural network determination using orthogonal genetic algorithm with quantization

LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Degraded dot matrix character recognition using CSM-based feature extraction

Proceedings of the 10th ACM symposium on Document engineering
A predication survival model for colorectal cancer

AMERICAN-MATH'11/CEA'11 Proceedings of the 2011 American conference on applied mathematics and the 5th WSEAS international conference on Computer engineering and applications
An imaging system for monitoring the in-and-out activity of honey bees

Computers and Electronics in Agriculture
A data acquisition and analysis system for palm leaf documents in Telugu

Proceeding of the workshop on Document Analysis and Recognition
Optical character recognition: A comprehensive study of hybrid methods

International Journal of Knowledge-based and Intelligent Engineering Systems

Quantified Score

Hi-index	0.14

Visualization

Abstract

Optical character recognition (OCR) refers to a process whereby printed documents are transformed into ASCII files for the purpose of compact storage, editing, fast retrieval, and other file manipulations through the use of a computer. The recognition stage of an OCR process is made difficult by added noise, image distortion, and the various character typefaces, sizes, and fonts that a document may have. In this study a neural network approach is introduced to perform high accuracy recognition on multi-size and multi-font characters; a novel centroid-dithering training process with a low noise-sensitivity normalization procedure is used to achieve high accuracy results. The study consists of two parts. The first part focuses on single size and single font characters, and a two-layered neural network is trained to recognize the full set of 94 ASCII character images in 12-pt Courier font. The second part trades accuracy for additional font and size capability, and a larger two-layered neural network is trained to recognize the full set of 94 ASCII character images for all point sizes from 8 to 32 and for 12 commonly used fonts. The performance of these two networks is evaluated based on a database of more than one million character images from the testing data set.