Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Parameter-Free Geometric Document Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Mastering Adobe InDesign
Two Geometric Algorithms for Layout Analysis
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Adaptive Document Segmentation and Geometric Relation Labeling: Algorithms and Experimental Results
ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume III-Volume 7276 - Volume 7276
Capturing the Layout of Electronic Documents for Reuse in Variable Data Printing
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Hi-index | 0.00 |
When graphic artist designs a page, they envision a set of text blocks of arbitrary shapes constrained by page size, image blocks and graphics blocks with wrap around properties. We call this the intended shape. What is seen on an actual page depends on the particular text content and typographical constrains such as natural text line breaking and justification. We call this the apparent shape. Our goal is to create document templates by extracting the text blocks' intended shapes from the apparent shapes. The main difficulty is when the line justification is jagged the intended block shape is obfuscated. We solve this problem by analyzing the layout relation of all blocks on a page and applying an iterative process to find the maximum likelihood of the intended shapes.