An improved document skew angle estimation technique
Pattern Recognition Letters
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Digit Classification on Signboards for Telephone Number Recognition
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Segmentation of Bangla Unconstrained Handwritten Text
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Multi-Script Line identification from Indian Documents
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Compression of scan-digitized Indian language printed text: a soft pattern matching technique
Proceedings of the 2003 ACM symposium on Document engineering
Adaptive Hindi OCR using generalized Hausdorff image comparison
ACM Transactions on Asian Language Information Processing (TALIP)
Detecting image orientation based on low-level visual content
Computer Vision and Image Understanding
A new algorithm for skew detection and correction
Pattern Recognition Letters
Texture for Script Identification
IEEE Transactions on Pattern Analysis and Machine Intelligence
Text line extraction from multi-skewed handwritten documents
Pattern Recognition
Optical character recognition for printed Hindi text in Devnagari using soft-computing technique
AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
Pattern Recognition Letters
Orientation detection of major Indian scripts
Proceedings of the International Workshop on Multilingual OCR
A fast skew detection and correction algorithm for machine printed words in Gurmukhi script
Proceedings of the International Workshop on Multilingual OCR
Multi-oriented Bangla and Devnagari text recognition
Pattern Recognition
Document skew estimation: an approach based on wavelets
Proceedings of the 2011 International Conference on Communication, Computing & Security
Re-targeting of multi-script document images for handheld devices
Proceedings of the 4th International Workshop on Multilingual OCR
Hi-index | 0.14 |
Skew angle detection of scanned documents containing most popular Indian scripts (Devnagari and Bangla) is considered. Most characters in these scripts have horizontal lines at the top, called head lines. The character head lines mostly join one another in a word and the word appears as a single component. In the proposed method the components are at first labeled. The upper envelope of a component is found by columnwise scanning from an imaginary line above the component. Portions of upper envelope satisfying the properties of digital straight line are detected. They are clustered as belonging to single text lines. Estimates from individual clusters are combined to get the skew angle. Apart from accuracy and efficiency, an advantage of the method is that character segmentation and zone detection can be readily done from head line information, which is useful in Optical Character Recognition approaches of these scripts.