Simple fast algorithms for the editing distance between trees and related problems
SIAM Journal on Computing
Error tolerant document structure analysis
IEEE ADL '97 Proceedings of the IEEE international forum on Research and technology advances in digital libraries
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric Structure Analysis of Document Images: A Knowledge-Based Approach
IEEE Transactions on Pattern Analysis and Machine Intelligence
Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals
IEEE Transactions on Pattern Analysis and Machine Intelligence
Logical Structure Analysis of Book Document Images Using Contents Information
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Distributed Knowledge-Based Parsing for Document Analysis and Understanding
ADL '99 Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries
Analysis of Synthetic Document Images
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
Automatic discovery of logical document structure
Automatic discovery of logical document structure
Geometric algorithms and experiments for automated document structuring
Mathematical and Computer Modelling: An International Journal
Hi-index | 0.00 |
This paper presents a syntactic method for logical structure analysis and generation for creation of Web documents. The method transforms document images with multiple pages and hierarchical structure into an XML document. To produce a logical structure more accurately and quickly than previous works of which the basic units are text lines, the proposed method takes text regions with hierarchical structure as input. Furthermore, we define a document model that is able to describe geometric characteristics and logical structure information of document class efficiently. Experimental results with 372 images scanned from the technical journal show that the method has performed logical structure analysis successfully. Particularly, the method generates XML documents as the result of structural analysis, so that it enhances the reusability of documents and independence of platform.