User-Assisted Archive Document Image Analysis for Digital Library Construction

  • Authors:
  • J. He;A. C. Downton

  • Affiliations:
  • -;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

A configurable archive document image analysissystem for digital library construction has been designedusing rapid prototyping and top-down iterativedevelopment methods. This approach has been found tobe essential in order to capture the curators' expertiseabout existing card archive structures, content anddatabases. The design currently achieves about 93%correct segmentation of the required archive card fieldsoverall, with 81.3% of all archive cards in a testset of2000 images having all fields correctly segmented andlabelled. Analysis of errors in the testset indicates thatheavily-annotated cards and non-standard card formatscomprise 5-10% of the overall archive, and a significantproportion of these are unlikely to be resolvable withoutcuratorial intervention.