Automated entry system for printed documents
Pattern Recognition
Skew correction of document images using interline cross-correlation
CVGIP: Graphical Models and Image Processing
Mean Shift: A Robust Approach Toward Feature Space Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Document Spectrum for Page Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Mean Shift, Mode Seeking, and Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
A new algorithm for skew detection and correction
Pattern Recognition Letters
A novel boundary growing approach for accurate skew estimation of binary document images
Pattern Recognition Letters
Skew Detection in Binary Image Documents Based on Image Dilation and Region labeling Approach
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Hough transform based fast skew detection and accurate skew correction methods
Pattern Recognition
The estimation of the gradient of a density function, with applications in pattern recognition
IEEE Transactions on Information Theory
A method of detecting the orientation of aligned components
Pattern Recognition Letters
Robust detection of skew in document images
IEEE Transactions on Image Processing
Display text segmentation after learning best-fitted OCR binarization parameters
Expert Systems with Applications: An International Journal
Hi-index | 12.05 |
In this paper, a new technique is proposed for global and local skew detection in complex color documents. The proposed technique, which can be applied also to grayscale and binary documents, consists of four main stages; color reduction, text localization, document binarization and skew correction. Color reduction limits the initial number of colors to a small number, usually smaller than 10 colors. Thus, the original documents are decomposed in homogenous regions. Text localization initially divides the document into a number of binary planes (color planes) equal to the number of the reduced colors. Then, connected component analysis is performed and text is extracted according to similarity features between adjacent connected components. In the third stage the binary document is composed by the processed binary planes. Finally, skew correction is achieved by detecting the direction of connection of the connected components in the binary images. The proposed technique has been extensively tested with two databases, one with complex scanned color cover books and another with grayscale scanned newspapers taken from the database of the university of Oulu. Experimental results are provided and prove the effectiveness of the proposed technique.