Morphological preprocessing method to thresholding degraded word images

Authors:
Shigueo Nomura;Keiji Yamanaka;Takayuki Shiose;Hiroshi Kawakami;Osamu Katai
Affiliations:
Department of Systems Science, Graduate School of Informatics, Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto 606-8501, Japan;Faculty of Electrical Engineering, Federal University of Uberlíndia, Uberlíndia 38400-902, Brazil;Department of Systems Science, Graduate School of Informatics, Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto 606-8501, Japan;Department of Systems Science, Graduate School of Informatics, Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto 606-8501, Japan;Department of Systems Science, Graduate School of Informatics, Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto 606-8501, Japan
Venue:
Pattern Recognition Letters
Year:
2009

Citing 16
Cited 5

A survey of thresholding techniques

Computer Vision, Graphics, and Image Processing
A new method for image segmentation

Computer Vision, Graphics, and Image Processing
Extraction of binary character/graphics images from grayscale document images

CVGIP: Graphical Models and Image Processing
Evaluation of Binarization Methods for Document Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Degraded Image Analysis: An Invariant Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Prototype Extraction and Adaptive OCR

IEEE Transactions on Pattern Analysis and Machine Intelligence
Twenty Years of Document Image Analysis in PAMI

IEEE Transactions on Pattern Analysis and Machine Intelligence
An Introduction to Digital Image Processing

An Introduction to Digital Image Processing
Goal-Directed Evaluation of Binarization Methods

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Performance Evaluation of Thresholding Algorithms for Optical character Recognition

ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Morphological Image Analysis: Principles and Applications

Morphological Image Analysis: Principles and Applications
Image Analysis and Mathematical Morphology

Image Analysis and Mathematical Morphology
Adaptive degraded document image binarization

Pattern Recognition
Adaptive, quadratic preprocessing of document images for binarization

IEEE Transactions on Image Processing
Binarization of color document images via luminance and saturation color features

IEEE Transactions on Image Processing
Improved techniques for automatic image segmentation

IEEE Transactions on Circuits and Systems for Video Technology

A shadow detection method for remote sensing images using affinity propagation algorithm

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Unsupervised range-constrained thresholding

Pattern Recognition Letters
Enhancement of historical printed document images by combining Total Variation regularization and Non-local Means filtering

Image and Vision Computing
Segment confidence-based binary segmentation (SCBS) for cursive handwritten words

Expert Systems with Applications: An International Journal
Image bilevel thresholding based on stable transition region set

Digital Signal Processing

Quantified Score

Hi-index	0.10

Visualization

Abstract

This paper presents a novel preprocessing method based on mathematical morphology techniques to improve the subsequent thresholding quality of raw degraded word images. The raw degraded word images contain undesirable shapes called critical shadows on the background that cause noise in binary images. This noise constitutes obstacles to posterior segmentation of characters. Direct application of a thresholding method produces inadequate binary versions of these degraded word images. Our preprocessing method called Shadow Location and Lightening (SL*L) adaptively, accurately and without manual fine-tuning of parameters locates these critical shadows on grayscale degraded images using morphological operations, and lightens them before applying eventual thresholding process. In this way, enhanced binary images without unpredictable and inappropriate noise can be provided to subsequent segmentation of characters. Then, adequate binary characters can be segmented and extracted as input data to optical character recognition (OCR) applications saving computational effort and increasing recognition rate. The proposed method is experimentally tested with a set of several raw degraded images extracted from real photos acquired by unsophisticated imaging systems. A qualitative analysis of experimental results led to conclusions that the thresholding result quality was significantly improved with the proposed preprocessing method. Also, a quantitative evaluation using a testing data of 1194 degraded word images showed the essentiality and effectiveness of the proposed preprocessing method to increase segmentation and recognition rates of their characters. Furthermore, an advantage of the proposed method is that Otsu's method as a simple and easily implementable global thresholding technique can be sufficient to reducing computational load.