Computing precision and recall with missing or uncertain ground truth

Authors:
Bart Lamiroy;Tao Sun
Affiliations:
Université de Lorraine, LORIA, UMR 7503, Nancy, France;Computer Science and Engineering, Lehigh University, Bethlehem, PA
Venue:
GREC'11 Proceedings of the 9th international conference on Graphics Recognition: new trends and challenges
Year:
2011

Citing 13
Cited 0

Information Retrieval

Information Retrieval
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Binarization of Low Quality Text Using a Markov Random Field Model

ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 3 - Volume 3
Incorporating non-local information into information extraction systems by Gibbs sampling

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A general framework for the evaluation of symbol recognition methods

International Journal on Document Analysis and Recognition
An Overview of the Tesseract OCR Engine

ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Pattern Recognition Methods for Querying and Browsing Technical Documentation

CIARP '08 Proceedings of the 13th Iberoamerican congress on Pattern Recognition: Progress in Pattern Recognition, Image Analysis and Applications
Results of the RIMES Evaluation Campaign for Handwritten Mail Processing

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Relying on topic subsets for system ranking estimation

Proceedings of the 18th ACM conference on Information and knowledge management
Document Analysis Algorithm Contributions in End-to-End Applications: Report on the ICDAR 2011 Contest

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
A probabilistic interpretation of precision, recall and F-score, with implication for evaluation

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Ground truth for layout analysis performance evaluation

DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Spatio-structural symbol description with statistical feature add-on

GREC'11 Proceedings of the 9th international conference on Graphics Recognition: new trends and challenges

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a way to use precision and recall measures in total absence of ground truth. We develop a probabilistic interpretation of both measures and show that, provided a sufficient number of data sources are available, it offers a viable performance measure to compare methods if no ground truth is available. This paper also shows the limitations of the approach, in case a systematic bias is present in all compared methods, but shows that it maintains a very high level of overall coherence and stability. It opens broader perspectives and can be extended to handling partial or unreliable ground truth, as well as levels of prior confidence in the methods it aims to compare.