Towards an omnilingual word retrieval system for ancient manuscripts

  • Authors:
  • Yann Leydier;Asma Ouji;Frank LeBourgeois;Hubert Emptoz

  • Affiliations:
  • Université de Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205, 20 av. Albert Einstein, Villeurbanne F-69621, France;Université de Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205, 20 av. Albert Einstein, Villeurbanne F-69621, France and Spigraph, 860 Rue René Descartes, Les Pléiades1-Bítiment A, Aix-en ...;Université de Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205, 20 av. Albert Einstein, Villeurbanne F-69621, France;Université de Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205, 20 av. Albert Einstein, Villeurbanne F-69621, France

  • Venue:
  • Pattern Recognition
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this article, we introduce the first method that allows the indexation of ancient manuscripts of any language and alphabet. We describe a word retrieval engine inspired by recent word-spotting advances on ancient manuscripts. Our approach does not need any layout segmentation and makes use of features fitted to any type of alphabet (Latin, Arabic, Chinese, etc.) and writing. The engine is tested on numerous documents and in several use-cases.