Geometric Structure Analysis of Document Images: A Knowledge-Based Approach
IEEE Transactions on Pattern Analysis and Machine Intelligence
Symbolic Learning Techniques in Paper Document Processing
MLDM '99 Proceedings of the First International Workshop on Machine Learning and Data Mining in Pattern Recognition
Adaptive Layout Analysis of Document Images
ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Correcting the Document Layout: A Machine Learning Approach
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Hi-index | 0.00 |
In this paper, we present a hybrid approach to the problem of the document analysis in which the document image is segmented by means of a top-down technique and then basic blocks are grouped bottom-up in order to form complex layout components. In this latter process, called layout analysis, only generic knowledge on typesetting conventions is exploited. Such knowledge is independent of the particular class of processed documents and turns out to be valuable for a wide range of documents. Preliminary results of the layout analysis system LEX (Layout EXpert) show the methodological validity of this approach.