A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Digital Image Processing
Extracting Text from WWW Images
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Hi-index | 0.00 |
The accuracy of the Optical Character Recognition (OCR) systems is highly dependent upon the quality of the image. In this paper, we investigate and propose solutions to several issues that can arise in the processing of binary images of scanned, typeset text. The issues of concern are Image Residues from Adjacent Lines, Character Touching, Boldface Character Recognition, and Text Repairing.