Access by content to handwritten archive documents: generic document recognition method and platform for annotations

  • Authors:
  • Bertrand Coüasnon;Jean Camillerapp;Ivan Leplumey

  • Affiliations:
  • Campus universitaire de Beaulieu, IRISA/INRIA, 35042, Rennes Cedex, France;Campus universitaire de Beaulieu, IRISA/INSA, 35042, Rennes Cedex, France;Campus universitaire de Beaulieu, IRISA/INSA, 35042, Rennes Cedex, France

  • Venue:
  • International Journal on Document Analysis and Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents annotations needed for handwritten archive document retrieval by content. We propose two complementary ways of producing these annotations: automatically by using document image analysis and collectively by using the Internet and manual input by users. A platform for managing these annotations is presented as well as examples of automatic annotations on civil status registers, military forms (tested on 165,000 pages) and naturalization decrees, using a generic method for structured document recognition and handwriting recognition on names. Examples of collective annotations built on automatic annotations are also given. This platform is already open to the public in the reading room of the new building of the Archives départementales des Yvelines and on the Internet. About 1,450,000 images of civil status registers are available for collective annotation as well as 105,000 pages of military forms with automatic annotation of handwritten names.