Towards Whole-Book Recognition

  • Authors:
  • Pingping Xiu;Henry S. Baird

  • Affiliations:
  • -;-

  • Venue:
  • DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe experimental results for unsupervised recognition of the textual contents of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment starts with approximate {\it iconic} and{\it linguistic} models---derived from (generally errorful) OCR results and (generally incomplete) dictionaries---and then runs a fully automatic adaptation algorithm which, guided entirely by evidence internal to the test set, attempts to correct the models for improved accuracy. The iconic model describes image formation and determines the behavior of a character-image classifier. The linguistic model describes word-occurrence probabilities. Our adaptation algorithm detects disagreements between the models by analyzing mutual entropy between (1) the {\em a posteriori} probability distribution of character classes (the recognition results from image classification alone), and (2) the {\em a posteriori} probability distribution of word classes (the recognition results from image classification combined with linguistic constraints). Disagreements identify candidates for automatic model corrections. We report experiments on 40 textlines in which word error rates fall monotonicaly with passage lengths. We also report experiments on an enhanced algorithm which can cope with character-segmentation errors (a single split, or a single merge, per word). In order to scale up experiments, soon, to whole book images, we have revised data structures and implemented speed enhancements. For this algorithm, we report results on three increasingly long passage lengths: (a) one full page, (b) five pages, and (b) ten pages. We observe that error rates on long words fall monotonically with passage lengths.