Hindi handwritten word recognition using HMM and symbol tree

Authors:
Swapnil Belhe;Chetan Paulzagade;Akash Deshmukh;Saumya Jetley;Kapil Mehrotra
Affiliations:
Center for Development of Advanced Computing (C-DAC), Pune, India;Center for Development of Advanced Computing (C-DAC), Pune, India;Center for Development of Advanced Computing (C-DAC), Pune, India;Center for Development of Advanced Computing (C-DAC), Pune, India;Center for Development of Advanced Computing (C-DAC), Pune, India
Venue:
Proceeding of the workshop on Document Analysis and Recognition
Year:
2012

Citing 19
Cited 1

Fast Discrete HMM Algorithm for On-line Handwriting Recognition

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Comparison of Elastic Matching Algorithms for Online Tamil Handwritten Character Recognition

IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Hybrid Recognition for One Stroke Style Cursive Handwriting Characters

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Machine Recognition of Online Handwritten Devanagari Characters

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Hidden Markov Models for Online Handwritten Tamil Word Recognition

ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
Offline Handwritten Devanagari Word Recognition: A Holistic Approach Based on Directional Chain Code Feature and HMM

ICIT '08 Proceedings of the 2008 International Conference on Information Technology
Off-line Cursive Handwritten Tamil Character Recognition

SECTECH '08 Proceedings of the 2008 International Conference on Security Technology
Divide and conquer technique in online handwritten Kannada character recognition

Proceedings of the International Workshop on Multilingual OCR
XML standard for Indic online handwritten database

Proceedings of the International Workshop on Multilingual OCR
A robust model for on-line handwritten japanese text recognition

International Journal on Document Analysis and Recognition - Special Issue DRR09
Orthogonal LDA in PCA Transformed Subspace

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Creation of a Huge Annotated Database for Tamil and Kannada OHR

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
A Hybrid Model for Recognition of Online Handwriting in Indian Scripts

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Online Handwritten Kannada Word Recognizer with Unrestricted Vocabulary

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Annotation Tool and XML Representation for Online Indic Data

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Online Handwriting Recognition of Tamil Script Using Fractal Geometry

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
Lexicon-Free, Novel Segmentation of Online Handwritten Indic Words

ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
HMM-Based Lexicon-Driven and Lexicon-Free Word Recognition for Online Handwritten Indic Scripts

IEEE Transactions on Pattern Analysis and Machine Intelligence

Unconstrained handwritten Devanagari character recognition using convolutional neural networks

Proceedings of the 4th International Workshop on Multilingual OCR

Quantified Score

Hi-index	0.00

Visualization

Abstract

The proposed approach performs recognition of online handwritten isolated Hindi words using a combination of HMMs trained on Devanagari symbols and a tree formed by the multiple, possible sequences of recognized symbols. In general, words in Indic languages are composed of a number of aksharas or syllables, which in turn are formed by groups of consonants and vowel modifiers. Segmentation of aksharas is critical to accurate recognition of both recognition primitives as well as the complete word. Also, recognition in itself is an intricate job. This holistic task of akshara segmentation, symbol identification and subsequent word recognition is targeted in our work. It is handled in an integrated segmentation-recognition framework. By making use of online stroke information for postulating symbol candidates and deriving HOG feature set from their image counterparts, the recognition becomes independent of stroke order and stroke shape variations. Thus, the system is well suited to unconstrained handwriting. Data for this work is collected from different parts of India where Hindi language is predominantly in use. Symbols extracted from 60,000 words are used to train and test 140 symbol-HMM models. The system is designed to output one or more candidate words to the user, by tracing multiple tree paths (up to leaf nodes) under the condition that the symbol likelihood (confidence score) at every node is above threshold. Tests performed on 10,000 words yield an accuracy of 89%.