Segmentation-Driven recognition applied to numerical field extraction from handwritten incoming mail documents

  • Authors:
  • Clément Chatelain;Laurent Heutte;Thierry Paquet

  • Affiliations:
  • Laboratoire PSI, CNRS FRE 2645, Université de Rouen, Saint Etienne du Rouvray, France;Laboratoire PSI, CNRS FRE 2645, Université de Rouen, Saint Etienne du Rouvray, France;Laboratoire PSI, CNRS FRE 2645, Université de Rouen, Saint Etienne du Rouvray, France

  • Venue:
  • DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a method for the automatic extraction of numerical fields (ZIP codes, phone numbers, etc.) from incoming mail documents. The approach is based on a segmentation-driven recognition that aims at locating isolated and touching digits among the textual information. A syntactical analysis is then performed on each line of text in order to filter the sequences that respect a particular syntax (number of digits, presence of separators) known by the system. We evaluate the performance of our system by means of the recall precision trade-off on a real incoming mail document database.