Alpha-Numerical Sequences Extraction in Handwritten Documents

  • Authors:
  • Simon Thomas;Clement Chatelain;Laurent Heutte;Thierry Paquet

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.02

Visualization

Abstract

In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented by a shallow parsing model. The shallow parsing of isolated text lines allows quick information extraction in any document while rejecting at the same time irrelevant information. Results on a public french incoming mails database show the efficiency of the approach.