Structural Decomposition and Statistical Description of Farsi/Arabic Handwritten Numeric Characters

  • Authors:
  • Saeed Mozaffari;Karim Faez;Majid Ziaratban

  • Affiliations:
  • University of Technology, Tehran, Iran;University of Technology, Tehran, Iran;University of Technology, Tehran, Iran

  • Venue:
  • ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

A Statistical method embedded with statistical features is proposed for Farsi/Arabic handwritten zip code recognition in this paper. The numeral is first smoothed and the skeleton is obtained. A set of feature points are then detected and the skeleton is decomposed into primitives. A primitive code includes the information of each primitive and a global code is derived from the primitive codes to describe the topological structure of the skeleton. By using the average and variance of X and Y changes in each primitive, the Direction and curvature of the skeleton can be statistically described. Since the global codes have different lengths, we applied PCA algorithm to normalize their lengths. Thanks to statistically description of the skeleton, we can use the nearest neighbor classifier for recognition. According to experimental results, classification rate of 94.44% is obtained for numerals on the test sets gathered from various people with different educational background and different ages. Our database includes 480 samples per digit. We used 280 samples of each digit for training and the rest (200) for test.