An OCR based on character shape codes and lexical information

Authors:
A. L. Spitz
Affiliations:
-
Venue:
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Year:
1995

Citing 0
Cited 5

How to read less and know more: approximate OCR for Thai

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Document Image Coding for Processing and Retrieval

Journal of VLSI Signal Processing Systems - special issue on multimedia signal processing
Prototype Extraction and Adaptive OCR

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature string-based intelligent information retrieval from Tamil document images

International Journal of Computer Applications in Technology
An indexed full-text search method of printed document images with an M-tree

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe an OCR process which has as its principal attributes high speed of operation and tunability to the lexical content of the documents to which it is applied. This process relies on the transformation of the text image into character shape codes, a rapid and robust process, and on special lexica which contain information on the "shape" of words and the character ambiguities present within particular word shape classifications. We rely on the structure of English (in the current case) and the high percentage of singleton mappings between the shape codes and the characters in the words. Considerable ambiguity is removed by simple lookup in the specially tuned and structured lexicon and substitution on a character-by-character basis. Ambiguity is further reduced by template matching using exemplars derived from surrounding text, taking advantage of the local consistency of font, face and size as well as image quality.