Low resolution Arabic recognition with multidimensional recurrent neural networks

  • Authors:
  • Sheikh Faisal Rashid;Marc-Peter Schambach;Jörg Rottland;Stephan von der Nüll

  • Affiliations:
  • Image Understanding and Pattern Recognition (IUPR), Technical University, Kaiserslautern, Germany;Siemens AG, Konstanz, Germany;Siemens AG, Konstanz, Germany;Siemens AG, Konstanz, Germany

  • Venue:
  • Proceedings of the 4th International Workshop on Multilingual OCR
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

OCR of multi-font Arabic text is difficult due to large variations in character shapes from one font to another. It becomes even more challenging if the text is rendered at very low resolution. This paper describes a multi-font, low resolution, and open vocabulary OCR system based on a multidimensional recurrent neural network architecture. For this work, we have developed various systems, trained for single-font/single-size, single-font/multi-size, and multi-font/multi-size data of the well known Arabic printed text image database (APTI). The evaluation tasks from the second Arabic text recognition competition, organized in conjunction with ICDAR 2013, have been adopted. Ten Arabic fonts in six font size categories are used for evaluation. Results show that the proposed method performs very well on the task of printed Arabic text recognition even for very low resolution and small font size images. Overall, the system yields above 99% recognition accuracy at character and word level for most of the printed Arabic fonts.