A General Approach to Quality Evaluation of Document Segmentation Results

  • Authors:
  • Michael Thulke;Volker Märgner;Andreas Dengel

  • Affiliations:
  • -;-;-

  • Venue:
  • DAS '98 Selected Papers from the Third IAPR Workshop on Document Analysis Systems: Theory and Practice
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to increase the performance of document analysis systems a detailed quality evaluation of the achieved results is required. By focussing on segmentation algorithms, we point out that the results produced by the module under consideration should be evaluated directly; we will show that the text-based evaluation method which is often used in the document analysis domain does not accomplish the purpose of a detailed quality evaluation. Therefore, we propose a general evaluation approach for the comparison of segmentation results which is based on the segments directly. This approach is able to handle both algorithms that produce complete segmentations (partition) and algorithms that only extract objects of interest (extraction). Classes of errors are defined in a systematic way, and frequencies for each class can be computed. The evaluation approach is applicable to segmentation or extraction algorithms in a wide range. We have chosen the character segmentation task as an example in order to demonstrate the applicability of our evaluation approach, and we suggest to apply our approach to other segmentation tasks.