Error handling approach using characterization and correction steps for handwritten document analysis

  • Authors:
  • Solen Quiniou;Mohamed Cheriet;Eric Anquetil

  • Affiliations:
  • Ecole de Technologie Superieure, Synchromedia Laboratory, 1100 rue Notre Dame Ouest, H3C 1K3, Montreal, QC, Canada;Ecole de Technologie Superieure, Synchromedia Laboratory, 1100 rue Notre Dame Ouest, H3C 1K3, Montreal, QC, Canada;IRISA—INSA, Campus de Beaulieu, 35042, Rennes Cedex, France

  • Venue:
  • International Journal on Document Analysis and Recognition
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a framework to handle recognition errors from a N-best list of output phrases given by a handwriting recognition system, with the aim to use the resulting phrases as inputs to a higher-level application. The framework can be decomposed into four main steps: phrase alignment, detection, characterization, and correction of word error hypotheses. First, the N-best phrases are aligned to the top-list phrase, and word posterior probabilities are computed and used as confidence indices to detect word error hypotheses on this top-list phrase (in comparison with a learned threshold). Then, the errors are characterized into predefined types, using the word posterior probabilities of the top-list phrase and other features to feed a trained SVM. Finally, the final output phrase is retrieved, thanks to a correction step that used the characterized error hypotheses and a designed word-to-class backoff language model. First experiments were conducted on the ImadocSen-OnDB handwritten sentence database and on the IAM-OnDB handwritten text database, using two recognizers. We present first results on an implementation of the proposed framework for handling recognition errors on transcripts of handwritten phrases provided by recognition systems.