Evaluating text categorization
HLT '91 Proceedings of the workshop on Speech and Natural Language
An evaluation of text analysis technologies
AI Magazine
Automated Evaluation of OCR Zoning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document image analysis
Evaluating and optimizing autonomous text classification systems
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
A General Approach to Quality Evaluation of Document Segmentation Results
DAS '98 Selected Papers from the Third IAPR Workshop on Document Analysis Systems: Theory and Practice
Ground-truthing and benchmarking document page segmentation
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Representations and Metrics for Off-Line Handwriting Segmentation
IWFHR '02 Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition (IWFHR'02)
Three Approaches to "Industrial" Table Spotting
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
An Automatic Performance Evaluation Method for Document Page Segmentation
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Automatic discovery of logical document structure
Automatic discovery of logical document structure
Evaluating SEE - A Benchmarking System for Document Page Segmentation
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Problem-adaptable document analysis and understanding for high-volume applications
International Journal on Document Analysis and Recognition
MUC5 '93 Proceedings of the 5th conference on Message understanding
MUC4 '92 Proceedings of the 4th conference on Message understanding
smartFIX statistics: towards systematic document analysis performance evaluation and optimization
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Hi-index | 0.00 |
An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchmarking of document analysis systems is presented, based on the processing levels: Document Page Segmentation, Text Recognition, Document Classification, and Information Extraction. The restriction to invoices enables and requires a more purposeful, i.e. detailed, targetting of the benchmarking procedures (acquisition of ground truth data, system runs, comparison of data, condensation into meaningful numbers). Therefore the processing of invoices is dissected. The involved data structures are elicited and presented. These are provided, being the building blocks of the actual benchmarking of invoice analysis systems.