On the Recognition of Printed Characters of Any Font and Size
IEEE Transactions on Pattern Analysis and Machine Intelligence
Optical character recognition by the method of moments
Computer Vision, Graphics, and Image Processing
Visual text recognition through contextual processing
Pattern Recognition
Hybrid Contextural Text Recognition with String Matching
IEEE Transactions on Pattern Analysis and Machine Intelligence
Shape quantization and recognition with randomized trees
Neural Computation
Quantitative modeling of segmental duration
HLT '93 Proceedings of the workshop on Human Language Technology
Hi-index | 0.00 |
An optical character recognition (OCR) system, which uses a multilayer perceptron (MLP) neural network classifier, is described. The neural network classifier has the advantage of being fast (highly parallel), easily trainable, and capable of creating arbitrary partitions of the input feature space. Issues in design of the neural network that we examine include the selection of input features, the choice of network learning and momentum parameters, and the selection of training patterns. We also provide a detailed analysis of the learning parameters to provide insight into the MLP, and to suggest a mechanism to automatically tune these parameters. An OCR neural network classifier was trained to recognize characters from a large number of fonts, thereby approaching an omnifont environment. Samples were selected from over 200 fonts and 50 typical office documents, for a total of 110,000 training patterns. In order to evaluate the performance of the MLP classifier, a comparison is made with a high performance dynamic contour warping (DCW) classifier. The base recognition rate on the test set is 96.7% for the neural network classifier, compared to 95.9% for the DCW classifier.