Introducing a very large dataset of handwritten Farsi digits and a study on their varieties

  • Authors:
  • Hossein Khosravi;Ehsanollah Kabir

  • Affiliations:
  • Department of Electrical Engineering, Tarbiat Modarres University, Tehran, Iran and Research and Development Unit, HODA System Co., Tehran, Iran;Department of Electrical Engineering, Tarbiat Modarres University, Tehran, Iran

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2007

Quantified Score

Hi-index 0.10

Visualization

Abstract

A very large dataset of handwritten Farsi digits is introduced. Binary images of 102,352 digits were extracted from about 12,000 registration forms of two types, filled by B.Sc. and senior high school students. These forms were scanned at 200dpi with a high speed scanner. A method for finding variety of handwritten digits in a typical dataset is proposed. Based on this method, training and test subsets are provided to facilitate sharing of results among researchers as well as performance comparison.