The sciences of the artificial (3rd ed.)
The sciences of the artificial (3rd ed.)
Automatic Extraction of Reference Linking Information from Online Documents
Automatic Extraction of Reference Linking Information from Online Documents
Difficult and Urgent Open Problems in Document Image Analysis for Libraries
DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
Robust document image understanding technologies
Proceedings of the 1st ACM workshop on Hardcopy document processing
Structuring documents according to their table of contents
Proceedings of the 2005 ACM symposium on Document engineering
Document digitization lifecycle for complex magazine collection
Proceedings of the 2005 ACM symposium on Document engineering
Detection and Segmentation of Table of Contents and Index Pages from Document Images
DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Quality Assurance in High Volume Document Digitization: A Survey
DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Performance comparison of six algorithms for page segmentation
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Multi-page document analysis based on format consistency and clustering
International Journal of Computer Applications in Technology
Document: a useful level for facing noisy data
AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
Xeproc©: a model-based approach towards document process preservation
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Document resizing for visually impaired students
Proceedings of the 22nd Conference of the Computer-Human Interaction Special Interest Group of Australia on Computer-Human Interaction
Identifying claimed knowledge updates in biomedical research articles
ACL '12 Proceedings of the Workshop on Detecting Structure in Scholarly Discourse
Hi-index | 0.00 |
We present in this paper a method for document layout analysis based on identifying the function of document elements (what they do). This approach is orthogonal and complementary to the traditional view based on the form of document elements (how they are constructed). One key advantage of such functional knowledge is that the functions of some document elements are very stable from document to document and over time. Relying on the stability of such functions, the method is not impacted by layout variability, a key issue in logical document analysis and is thus very robust and versatile. The method starts the recognition process by using functional knowledge and uses in a second step formal knowledge as a source of feedback in order to correct some errors. This allows the method to adapt to specific documents by using formal specificities.