Page segmentation and classification utilising a bottom-up approach

Authors:
D. Drivas;A. Amin
Affiliations:
-;-
Venue:
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Year:
1995

Citing 0
Cited 10

Parameter-Free Geometric Document Layout Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Newspaper document analysis featuring connected line segmentation

VIP '01 Proceedings of the Pan-Sydney area workshop on Visual information processing - Volume 11
Visual signature based identification of Low-resolution document images

Proceedings of the 2004 ACM symposium on Document engineering
Visual similarity based document layout analysis

Journal of Computer Science and Technology - Special section on China AVS standard
The fast scheme for document page segmentation in OCR using window and optimum image

CIMMACS'06 Proceedings of the 5th WSEAS International Conference on Computational Intelligence, Man-Machine Systems and Cybernetics
Decomposing document images by heuristic search

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Region analysis of business card images acquired in PDA using DCT and information pixel density

ACIVS'05 Proceedings of the 7th international conference on Advanced Concepts for Intelligent Vision Systems
A bottom-up OCR system for mathematical formulas recognition

ICIC'06 Proceedings of the 2006 international conference on Intelligent Computing - Volume Part I
Applying preattentive visual guidance in document image analysis

IWICPAS'06 Proceedings of the 2006 Advances in Machine Vision, Image Processing, and Pattern Analysis international conference on Intelligent Computing in Pattern Analysis/Synthesis
A new pyramidal approach for the address block location based on hierarchical graph coloring

ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the use of analysing the connected components extracted from the binary image of a document page. Such an analysis provides a lot of useful information, and will be used to perform skew correction, segmentation and classification of the document. We present a new algorithm for determining the skew angle of lines of text in an image of a document with the advantage that it only performs one iteration to determine the skew angle. Experiments on over 30 pages show that the method works well on a wide variety of layouts, including sparse textual regions, mixed fonts, multiple columns, and even for documents with a high graphical content.