Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Digital Image Processing (3rd Edition)
Digital Image Processing (3rd Edition)
Machine Recognition of Online Handwritten Devanagari Characters
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Annotation Tool and XML Representation for Online Indic Data
ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Convolutional Neural Network Committees for Handwritten Character Classification
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
Hindi handwritten word recognition using HMM and symbol tree
Proceeding of the workshop on Document Analysis and Recognition
Hi-index | 0.00 |
In this paper, we introduce a novel offline strategy for recognition of online handwritten Devanagari characters entered in an unconstrained manner. Unlike the previous approaches based on standard classifiers - SVM, HMM, ANN and trained on statistical, structural or spectral features, our method, based on CNN, allows writers to enter characters in any number or order of strokes and is also robust to certain amount of overwriting. The CNN architecture supports an increased set of 42 Devanagari character classes. Experiments with 10 different configurations of CNN and for both Exponential Decay and Inverse Scale Annealing approaches to convergence, show highly promising results. In a further improvement, the final layer neuron outputs of top 3 configurations are averaged and used to make the classification decision, achieving an accuracy of 99.82% on the train data and 98.19% on the test data. This marks an improvement of 0.2% and 5.81%, for the train and test set respectively, over the existing state-of-the-art in unconstrained input. The data used for building the system is obtained from different parts of Devanagari writing states in India, in the form of isolated words. Character level data is extracted from the collected words using a hybrid approach and covers all possible variations owing to the different writing styles and varied parent word structures.