Towards historical document indexing: extraction of drop cap letters

  • Authors:
  • Mickael Coustaty;Rudolf Pareti;Nicole Vincent;Jean-Marc Ogier

  • Affiliations:
  • Avenue Michel Crepeau, Imedoc Team - L3i Laboratory, 17042, La Rochelle, France;SIP Team, LIPADE Laboratory, 45, rue des Saints-Peres, 75270, Paris Cedex 06, France;SIP Team, LIPADE Laboratory, 45, rue des Saints-Peres, 75270, Paris Cedex 06, France;Avenue Michel Crepeau, Imedoc Team - L3i Laboratory, 17042, La Rochelle, France

  • Venue:
  • International Journal on Document Analysis and Recognition - Special issue - Selected and extended papers from ICDAR2009
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper deals with the difficult problem of indexing ancient graphic images. It tackles the particular case of indexing drop caps (also called Lettrines) and specifically, considers the problem of letter extraction from this complex graphic images. Based on an analysis of the features of the images to be indexed, an original strategy is proposed. This approach relies on filtering the relevant information, on the basis of Meyer decomposition. Then, in order to accommodate the variability of representation of the information, a Zipf’s law modeling enables detection of the regions belonging to the letter, what allows it to be segmented. The overall process is evaluated using a relevant set of images, which shows the relevance of the approach.