Unconstrained handwritten Devanagari character recognition using convolutional neural networks

Authors:
Kapil Mehrotra;Saumya Jetley;Akash Deshmukh;Swapnil Belhe
Affiliations:
Centre for Development of Advanced Computing (C-DAC), Pune, India;Centre for Development of Advanced Computing (C-DAC), Pune, India;Centre for Development of Advanced Computing (C-DAC), Pune, India;Centre for Development of Advanced Computing (C-DAC), Pune, India
Venue:
Proceedings of the 4th International Workshop on Multilingual OCR
Year:
2013

Citing 8
Cited 0

Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Digital Image Processing (3rd Edition)

Digital Image Processing (3rd Edition)
Machine Recognition of Online Handwritten Devanagari Characters

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Comparative Study of Devnagari Handwritten Character Recognition Using Different Feature and Classifiers

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A Novel Multi-feature Multi-classifier Scheme for Unconstrained Handwritten Devanagari Character Recognition

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Annotation Tool and XML Representation for Online Indic Data

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Convolutional Neural Network Committees for Handwritten Character Classification

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
Hindi handwritten word recognition using HMM and symbol tree

Proceeding of the workshop on Document Analysis and Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we introduce a novel offline strategy for recognition of online handwritten Devanagari characters entered in an unconstrained manner. Unlike the previous approaches based on standard classifiers - SVM, HMM, ANN and trained on statistical, structural or spectral features, our method, based on CNN, allows writers to enter characters in any number or order of strokes and is also robust to certain amount of overwriting. The CNN architecture supports an increased set of 42 Devanagari character classes. Experiments with 10 different configurations of CNN and for both Exponential Decay and Inverse Scale Annealing approaches to convergence, show highly promising results. In a further improvement, the final layer neuron outputs of top 3 configurations are averaged and used to make the classification decision, achieving an accuracy of 99.82% on the train data and 98.19% on the test data. This marks an improvement of 0.2% and 5.81%, for the train and test set respectively, over the existing state-of-the-art in unconstrained input. The data used for building the system is obtained from different parts of Devanagari writing states in India, in the form of isolated words. Character level data is extracted from the collected words using a hybrid approach and covers all possible variations owing to the different writing styles and varied parent word structures.