Automatic indexing of French handwritten census registers for probate geneaology

  • Authors:
  • Cédric Sibade;Thomas Retornaz;Thibauld Nion;Romain Lerallut;Christopher Kermorvant

  • Affiliations:
  • AZiA, Artificial Intelligence and Image Analysis, Paris - France;AZiA, Artificial Intelligence and Image Analysis, Paris - France;AZiA, Artificial Intelligence and Image Analysis, Paris - France;AZiA, Artificial Intelligence and Image Analysis, Paris - France;AZiA, Artificial Intelligence and Image Analysis, Paris - France

  • Venue:
  • Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the complete indexing process of the registers of a French census dating back to more than a hundred years, from image analysis to the integration into the information system, in the context of probate genealogy. The documents of interest are composed of a table of personal information in which the cells containing the first name, the surname and the relation to head of household must be extracted and recognized. More than 30 millions of cells were processed and their content either directly integrated into the information system or sent to keyers for manual validation, allowing an automation rate at 80% while keeping the error rate below 15% on average. Based on this project, we have started the development of a generic platform for table-based historical documents processing including new functionalities and a more generic and user-friendly table model definition interface.