Document image characterization using a multiresolution analysis of the texture: application to old documents

  • Authors:
  • Nicholas Journet;Jean-Yves Ramel;Rémy Mullot;Véronique Eglin

  • Affiliations:
  • LI, 64 Avenue Jean Portalis, 37200, Tours, France;LI, 64 Avenue Jean Portalis, 37200, Tours, France;L3I, 64 Avenue Jean Portalis, 17042, La Rochelle Cedex 1, France;LIRIS INSA de Lyon, 64 Avenue Jean Portalis, 17042, Villeurbanne Cedex, France

  • Venue:
  • International Journal on Document Analysis and Recognition
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this article, we propose a method of characterization of images of old documents based on a texture approach. This characterization is carried out with the help of a multi-resolution study of the textures contained in the images of the document. Thus, by extracting five features linked to the frequencies and to the orientations in the different areas of a page, it is possible to extract and compare elements of high semantic level without expressing any hypothesis about the physical or logical structure of the analyzed documents. Experimentation based on segmentation, data analysis and document image retrieval tools demonstrate the performance of our propositions and the advances that they represent in terms of characterization of content of a deeply heterogeneous corpus.