On the Evaluation of Document Analysis Components by Recall, Precision, and Accuracy

Authors:
Markus Junker;Andreas Dengel;Rainer Hoch
Affiliations:
-;-;-
Venue:
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Year:
1999

Citing 0
Cited 20

Restoration of Archival Documents Using a Wavelet Technique

IEEE Transactions on Pattern Analysis and Machine Intelligence
StrCombo: combination of string recognizers

Pattern Recognition Letters - In memory of Professor E.S. Gelsema
Wavelet Applications in Segmentation of Handwriting in Archival Documents

WAA '01 Proceedings of the Second International Conference on Wavelet Analysis and Its Applications
Correcting Document Image Warping Based on Regression of Curved Text Lines

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Comparison of Some Thresholding Algorithms for Text/Background Segmentation in Difficult Document Images

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Directional Wavelet Approach to Remove Document Image Interference

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Restoring Warped Document Images through 3D Shape Modeling

IEEE Transactions on Pattern Analysis and Machine Intelligence
Independent Component Analysis Segmentation Algorithm

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Semantic information integration and question answering based on pervasive agent ontology

Expert Systems with Applications: An International Journal
A comparison of text-classification techniques applied to Arabic text

Journal of the American Society for Information Science and Technology
Transition pixel: A concept for binarization based on edge detection and gray-intensity histograms

Pattern Recognition
Restoring Chinese documents images based on text boundary lines

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Feature selection algorithms to improve documents' classification performance

AWIC'03 Proceedings of the 1st international Atlantic web intelligence conference on Advances in web intelligence
Restoration of double-sided ancient music documents with bleed-through

CIARP'07 Proceedings of the Congress on pattern recognition 12th Iberoamerican conference on Progress in pattern recognition, image analysis and applications
An intelligent agent-based system for multilingual financial news digest

KES-AMSTA'08 Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applications
smartFIX statistics: towards systematic document analysis performance evaluation and optimization

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
An intelligent agent-based system for multilingual financial news digest

International Journal of Intelligent Information and Database Systems
Identification of fraudulent financial statements using linguistic credibility analysis

Decision Support Systems
Methods for written ancient music restoration

ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition
IODetector: a generic service for indoor outdoor detection

Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In document analysis, it is common to prove the usefulness of a component by an experimental evaluation. By applying the respective algorithms to a test sample, effectiveness measures such as recall, precision, and accuracy are computed.The goal of such an evaluation is two-fold: on the one hand it shows that the absolute effectiveness of the algorithm is acceptable for practical use. On the other hand, the evaluation can prove that the algorithm has a better or worse effectiveness than another algorithm.In this paper we argue that the experimental evaluation on relative small test sets - as is very common in document analysis - has to be taken with extreme care from a statistical point of view. In fact, it is surprising how weak statements derived from such evaluations are.