Recognition of Printed Urdu Script

  • Authors:
  • U. Pal;Anirban Sarkar

  • Affiliations:
  • -;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper deals with an Optical Character Recognitionsystem for printed Urdu, a popular Indian script. Thedevelopment of OCR for this script is difficult because (i) alarge number of characters have to be recognized (ii) thereare many similar shaped characters. In the proposedsystem individual characters are recognized using acombination of topological, contour and water reservoirconcept based features. The feature detection methods aresimple and robust. A prototype of the system has beentested on printed Urdu characters and currently achieves97.8% character level accuracy on average.