Digital image processing (2nd ed.)
Digital image processing (2nd ed.)
The Document Spectrum for Page Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Page Segmentation of Chinese Newspaper
PRIS '01 Proceedings of the 1st International Workshop on Pattern Recognition in Information Systems: In conjunction with ICEIS 2001
Adaptive Layout Analysis of Document Images
ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
A Complete Pyramidal Geometrical Scheme for Text Based Image Description and Retrieval
ICISP '08 Proceedings of the 3rd international conference on Image and Signal Processing
Hi-index | 0.00 |
Based on the study of the specificity of historical printed books and on the main error sources of classical methods of page layout analysis, this paper presents a new way to achieve an indexation of ancient printed documents. We have developed an approach based on the extraction and the quantification of the various orientations that are present in printed document images. The documents are initially splitted into homogenous areas in which we analyze significant orientations with a directional rose. Each kind of information (textual or graphical) is typically identified and labelled according to its orientation distribution. This choice of characterization allows us to separate textual regions from graphical ones by minimizing the a priori knowledge. The evaluation of our proposition lies on a document image retrieval using layout extraction criteria and can also be used to precisely localize graphical parts in various types of documents. The system has been tested with success over several ancient printed books of the Renaissance.