Comparison of global and cascading recognition systems applied to multi-font arabic text

  • Authors:
  • Fouad Slimane;Slim Kanoun;Adel M. Alimi;Jean Hennebert;Rolf Ingold

  • Affiliations:
  • University of Fribourg, Fribourg, Switzerland;REGIM, University of Sfax, National School of Engineers (ENIS), Sfax, Tunisia;REGIM, University of Sfax, National School of Engineers (ENIS), Sfax, Tunisia;Business Information System Institute, HES-SO // Wallis, Sierre, Switzerland;DIVA: Document, Image and Voice Analysis research group, Fribourg, Switzerland

  • Venue:
  • Proceedings of the 10th ACM symposium on Document engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A known difficulty of Arabic text recognition is in the large variability of printed representation from one font to the other. In this paper, we present a comparative study between two strategies for the recognition of multi-font Arabic text. The first strategy is to use a global recognition system working independently on all the fonts. The second strategy is to use a so-called cascade built from a font identification system followed by font-dependent systems. In order to reach a fair comparison, the feature extraction and the modeling algorithms based on HMMs are kept as similar as possible between both approaches. The evaluation is carried out on the large and publicly available APTI (Arabic Printed Text Image) database with 10 different fonts. The results are showing a clear advantage of performance for the cascading approach. However, the cascading system is more costly in terms of cpu and memory.