Databases for Research on Recognition of Handwritten Characters of Indian Scripts

Authors:
U. Bhattacharya;B. B. Chaudhuri
Affiliations:
CVPR Unit, Indian Statistical Institute, Kolkata, India;CVPR Unit, Indian Statistical Institute, Kolkata, India
Venue:
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Year:
2005

Citing 4
Cited 6

A nonlinear normalization method for handprinted Kanji character recognition—line density equalization

Pattern Recognition
A Database for Handwritten Text Recognition Research

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Majority Voting Scheme for Multiresolution Recognition of Handprinted Numerals

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Integrating knowledge sources in Devanagari text recognition system

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans

A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters

Pattern Recognition
HIT-OR3C: an opening recognition corpus for Chinese characters

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Lampung - a new handwritten character benchmark: database, labeling and recognition

Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Recognition of handwritten indic script using clonal selection algorithm

ICARIS'06 Proceedings of the 5th international conference on Artificial Immune Systems
Recognition of off-line handwritten devnagari characters using quadratic classifier

ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Development of comprehensive devnagari numeral and character database for offline handwritten character recognition

Applied Computational Intelligence and Soft Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Three image databases of handwritten isolated numerals of three different Indian scripts namely Devnagari, Bangla and Oriya are described in this paper. Grayscale images of 22556 Devnagari numerals written by 1049 persons, 12938 Bangla numerals written by 556 persons and 5970 Oriya numerals written by 356 persons form the respective databases. These images were scanned from three different kinds of handwritten documents postal mails, job application form and another set of forms specially designed by the collectors for the purpose. The only restriction imposed on the writers is to write each numeral within a rectangular box. These databases are free from the limitations that they are neither developed in laboratory environments nor they are non-uniformly distributed over different classes. Also, for comparison purposes, each database has been properly divided into respective training and test sets.