Machine Learning of Generalized Document Templates for Data Extraction
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Thick 2D relations for document understanding
Information Sciences—Informatics and Computer Science: An International Journal
Hi-index | 0.00 |
In this paper an architecture for understanding documents of a domain that can be grouped into classes is shown. Documents are grouped with respect to the physical structure. The architecture is based on two knowledge descriptions of the domain: one is independent from the classes and one related to the classes. Such knowledge levels are used to understand the documents of the domain. The understanding phase is described in relation with the phases of analysis and classification of such documents.1