Rectangle Labelling for an Invoice Understanding System
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Automatic Acquisition of Layout Knowledge for Understanding Business Cards
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
A Generic System for Processing Invoices
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
User-Defined Template for Identifying Document Type and Extracting Information from Documents
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Fast Lexicon-Based Word Recognition in Noisy Index Card Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
A template-based method for identifying input regions in survey forms
Pattern Recognition and Image Analysis
Hi-index | 0.00 |
This paper describes a universal technology for automated data capture from documents with similar data but different layouts, such as invoices, claim forms, résumés, contracts, loan documents, etc. Prior to data capture, the relevant data are detected on the document image. A formalization of top-down document analysis is suggested and a language for describing document structures is presented. Formalized descriptions in this language can be compiled into executable code. The process of matching such formalized descriptions with actual semi-structured documents in order to find the relevant data is described.