A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Two complementary techniques for digitized document analysis
DOCPROCS '88 Proceedings of the ACM conference on Document processing systems
Classification of newspaper image blocks using texture analysis
Computer Vision, Graphics, and Image Processing
Text segmentation using Gabor filters for automatic document processing
Machine Vision and Applications - Special issue: document image analysis techniques
Page segmentation and classification
CVGIP: Graphical Models and Image Processing
Automated document segmentation
Pattern Recognition Letters
Document image analysis
Segmentation and classification of mixed text/graphics/image documents
Pattern Recognition Letters
A document recognition system and its applications
IBM Journal of Research and Development
Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Page segmentation using the description of the background
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Document Spectrum for Page Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Predictive Coding for Document Layout Characterization
DIA '97 Proceedings of the 1997 Workshop on Document Image Analysis
Document page decomposition by the bounding-box project
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Analysis of Synthetic Document Images
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Structured Document Segmentation and Representation by the Modified X-Y tree
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
A knowledge-based approach to the layout analysis
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
ODIL: an SGML description language of the layout structure of documents
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
Extraction of text areas in printed document images
DocEng '01 Proceedings of the 2001 ACM Symposium on Document engineering
Making Documents Work: Challenges for Document Understanding
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Mathematical Formulas Extraction
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Text - Image Separation in Devanagari Documents
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Structure analysis and generation for internet documents
Intelligent exploration of the web
Logical Structure Analysis and Generation for Structured Documents: A Syntactic Approach
IEEE Transactions on Knowledge and Data Engineering
Thick 2D relations for document understanding
Information Sciences—Informatics and Computer Science: An International Journal
Document digitization lifecycle for complex magazine collection
Proceedings of the 2005 ACM symposium on Document engineering
Robust and Accurate Vectorization of Line Drawings
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document zone content classification and its performance evaluation
Pattern Recognition
ICAISC'10 Proceedings of the 10th international conference on Artifical intelligence and soft computing: Part II
An intelligent method to extract characters in color document with highlight regions
IEA/AIE'11 Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part II
Expert Systems with Applications: An International Journal
Hi-index | 0.15 |
Geometric structure analysis is a prerequisite to create electronic documents from logical components extracted from document images. This paper presents a knowledge-based method for sophisticated geometric structure analysis of technical journal pages. The proposed knowledge base encodes geometric characteristics that are not only common in technical journals but also publication-specific in the form of rules. The method takes the hybrid of top-down and bottom-up techniques and consists of two phases: region segmentation and identification. Generally, the result of the segmentation process does not have a one-to-one matching with composite layout components. Therefore, the proposed method identifies nontext objects, such as images, drawings, and tables, as well as text objects, such as text lines and equations, by splitting or grouping segmented regions into composite layout components. Experimental results with 372 images scanned from the IEEE Transactions on Pattern Analysis and Machine Intelligence show that the proposed method has performed geometric structure analysis successfully on more than 99 percent of the test images, resulting in impressive performance compared with previous works.