A Theory for Multiresolution Signal Decomposition: The Wavelet Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Text segmentation using Gabor filters for automatic document processing
Machine Vision and Applications - Special issue: document image analysis techniques
A Fast Algorithm for Bottom-Up Document Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray Level Background
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Page segmentation using the description of the background
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document page decomposition by the bounding-box project
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Page segmentation and classification utilising a bottom-up approach
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Recursive X-Y cut using bounding boxes of connected components
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Page segmentation using texture analysis
Pattern Recognition
Neural-Based Classification of Blocks from Documents
ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
Machine Printed Text and Handwriting Identification in Noisy Document Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Structuralizing digital ink for efficient selection
Proceedings of the 11th international conference on Intelligent user interfaces
Visual similarity based document layout analysis
Journal of Computer Science and Technology - Special section on China AVS standard
Text block geometric shape analysis
Proceedings of the 2006 ACM symposium on Document engineering
Device parts retrieval from assembly drawings with SVM based active relevance feedback
Proceedings of the 6th ACM international conference on Image and video retrieval
Retrieval of document images based on page layout similarity
AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
An intelligent method to extract characters in color document with highlight regions
IEA/AIE'11 Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part II
Region analysis of business card images acquired in PDA using DCT and information pixel density
ACIVS'05 Proceedings of the 7th international conference on Advanced Concepts for Intelligent Vision Systems
SmartDCap: semi-automatic capture of higher quality document images from a smartphone
Proceedings of the 2013 international conference on Intelligent user interfaces
Hi-index | 0.14 |
Automatic transformation of paper documents into electronic documents requires geometric document layout analysis at the first stage. However, variations in character font sizes, text line spacing, and document layout structures have made it difficult to design a general-purpose document layout analysis algorithm for many years. The use of some parameters has therefore been unavoidable in previous methods. In this paper, we propose a parameter-free method for segmenting the document images into maximal homogeneous regions and identifying them as texts, images, tables, and ruling lines. A pyramidal quadtree structure is constructed for multiscale analysis and a periodicity measure is suggested to find a periodical attribute of text regions for page segmentation. To obtain robust page segmentation results, a confirmation procedure using texture analysis is applied to only ambiguous regions. Based on the proposed periodicity measure, multiscale analysis, and confirmation procedure, we could develop a robust method for geometric document layout analysis independent of character font sizes, text line spacing, and document layout structures. The proposed method was experimented with the document database from the University of Washington and the MediaTeam Document Database. The results of these tests have shown that the proposed method provides more accurate results than the previous ones.