Handwriting Recognition in Indian Regional Scripts: A Survey of Offline Techniques

Authors:
Umapada Pal;Ramachandran Jayadevan;Nabin Sharma
Affiliations:
Indian Statistical Institute;Pune Institute of Computer Technology;Indian Statistical Institute
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2012

Citing 53
Cited 3

A connectionist expert system model for conflict resolution in unconstrained handwritten numeral recognition

Pattern Recognition Letters
On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
Segmentation of Bangla Handwritten Text into Characters by Recursive Contour Following

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Fuzzy Approach to Recognize Handwritten Tamil Characters

ICCIMA '99 Proceedings of the 3rd International Conference on Computational Intelligence and Multimedia Applications
A Majority Voting Scheme for Multiresolution Recognition of Handprinted Numerals

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Recognition of Printed Urdu Script

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Segmentation of Bangla Unconstrained Handwritten Text

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Handwriting Segmentation of Unconstrained Oriya Text

IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
On the Choice of Training Set, Architecture and Combination Rule of Multiple MLP Classifiers for Multiresolution Recognition of Handwritten Characters

IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
A System towards Indian Postal Automation

IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
Oriya Handwritten Numeral Recognition Syste

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Structural Information Implant in a Context Based Segmentation-Free HMM Handwritten Word Recognition System for Latin and Bangla Script

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
A Novel Approach to Skew Detection and Character Segmentation for Handwritten Bangla Words

DICTA '05 Proceedings of the Digital Image Computing on Techniques and Applications
An Iterative Algorithm for Segmentation of Isolated Handwritten Words in Gurmukhi Script

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
An HMM Based Recognition Scheme for Handwritten Oriya Numerals

ICIT '06 Proceedings of the 9th International Conference on Information Technology
Recognition of Handwritten Kannada Numerals

ICIT '06 Proceedings of the 9th International Conference on Information Technology
Handwritten Bangla numeral recognition system and its application to postal automation

Pattern Recognition
Text line extraction from multi-skewed handwritten documents

Pattern Recognition
A Fuzzy Technique for Segmentation of Handwritten Bangla Word Images

ICCTA '07 Proceedings of the International Conference on Computing: Theory and Applications
Curvelet-Based Multi SVM Recognizer for Offline Handwritten Bangla: A Major Indian Script

ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
A Two Stage Recognition Scheme for Handwritten Tamil Characters

ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
Handwritten Numeral Recognition of Six Popular Indian Scripts

ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Handwritten Kannada Numeral Recognition Based on Structural Features

ICCIMA '07 Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007) - Volume 02
Neural Network Based Offline Tamil Handwritten Character Recognition System

ICCIMA '07 Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007) - Volume 02
1D Wavelet Transform of Projection Profiles for Isolated Handwritten Malayalam Character Recognition

ICCIMA '07 Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007) - Volume 02
Handwritten Bangla Compound Character Recognition Using Gradient Feature

ICIT '07 Proceedings of the 10th International Conference on Information Technology
A System for Off-Line Oriya Handwritten Character Recognition Using Curvature Feature

ICIT '07 Proceedings of the 10th International Conference on Information Technology
Isolated Handwritten Kannada and Tamil Numeral Recognition: A Novel Approach

ICETET '08 Proceedings of the 2008 First International Conference on Emerging Trends in Engineering and Technology
Trilingual Script Separation of Handwritten Postal Document

ICVGIP '08 Proceedings of the 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing
Off-line Cursive Handwritten Tamil Character Recognition

SECTECH '08 Proceedings of the 2008 International Conference on Security Technology
Discrete Curve Evolution Based Skeleton Pruning for Character Recognition

ICAPR '09 Proceedings of the 2009 Seventh International Conference on Advances in Pattern Recognition
A hierarchical approach to recognition of handwritten Bangla characters

Pattern Recognition
FLD Based Unconstrained Handwritten Kannada Character Recognition

FGCNS '08 Proceedings of the 2008 Second International Conference on Future Generation Communication and Networking Symposia - Volume 03
SVM-based hierarchical architectures for handwritten Bangla character recognition

International Journal on Document Analysis and Recognition
A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters

Pattern Recognition
A New Large Urdu Database for Off-Line Handwriting Recognition

ICIAP '09 Proceedings of the 15th International Conference on Image Analysis and Processing
Handwritten Text Line Identification in Indian Scripts

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Template Matching Algorithm for Gujrati Character Recognition

ICETET '09 Proceedings of the 2009 Second International Conference on Emerging Trends in Engineering & Technology
Gujarati handwritten numeral optical character reorganization through neural network

Pattern Recognition
2 directional 2 dimensional pairwise FLD for handwritten Kannada numeral recognition

ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Recognition of isolated handwritten Kannada numerals based on image fusion method

PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
Handwritten character recognition of popular south Indian scripts

SACH'06 Proceedings of the 2006 conference on Arabic and Chinese handwriting recognition
Adapting Moments for Handwritten Kannada Kagunita Recognition

ICMLC '10 Proceedings of the 2010 Second International Conference on Machine Learning and Computing
Isolated Handwritten Malayalam Character Recognition Using HLH Intensity Patterns

ICMLC '10 Proceedings of the 2010 Second International Conference on Machine Learning and Computing
A novel framework for automatic sorting of postal documents with multi-script address blocks

Pattern Recognition
Holistic Urdu Handwritten Word Recognition Using Support Vector Machine

ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Script Recognition—A Review

IEEE Transactions on Pattern Analysis and Machine Intelligence
A new scheme for unconstrained handwritten text-line segmentation

Pattern Recognition
Off-line Recognition of Hand-Written Bengali Numerals Using Morphological Features

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Word Segmentation and Baseline Detection in Handwritten Documents Using Isothetic Covers

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
Pre and Post Processing Approaches in Edge Detection for Character Recognition

ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
On recognition of handwritten bangla characters

ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Offline Recognition of Devanagari Script: A Survey

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

A system for offline character recognition using auto-encoder networks

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
A data acquisition and analysis system for palm leaf documents in Telugu

Proceeding of the workshop on Document Analysis and Recognition
The optical character recognition of Urdu-like cursive scripts

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Offline handwriting recognition in Indian regional scripts is an interesting area of research as almost 460 million people in India use regional scripts. The nine major Indian regional scripts are Bangla (for Bengali and Assamese languages), Gujarati, Kannada, Malayalam, Oriya, Gurumukhi (for Punjabi language), Tamil, Telugu, and Nastaliq (for Urdu language). A state-of-the-art survey about the techniques available in the area of offline handwriting recognition (OHR) in Indian regional scripts will be of a great aid to the researchers in the subcontinent and hence a sincere attempt is made in this article to discuss the advancements reported in this regard during the last few decades. The survey is organized into different sections. A brief introduction is given initially about automatic recognition of handwriting and official regional scripts in India. The nine regional scripts are then categorized into four subgroups based on their similarity and evolution information. The first group contains Bangla, Oriya, Gujarati and Gurumukhi scripts. The second group contains Kannada and Telugu scripts and the third group contains Tamil and Malayalam scripts. The fourth group contains only Nastaliq script (Perso-Arabic script for Urdu), which is not an Indo-Aryan script. Various feature extraction and classification techniques associated with the offline handwriting recognition of the regional scripts are discussed in this survey. As it is important to identify the script before the recognition step, a section is dedicated to handwritten script identification techniques. A benchmarking database is very important for any pattern recognition related research. The details of the datasets available in different Indian regional scripts are also mentioned in the article. A separate section is dedicated to the observations made, future scope, and existing difficulties related to handwriting recognition in Indian regional scripts. We hope that this survey will serve as a compendium not only for researchers in India, but also for policymakers and practitioners in India. It will also help to accomplish a target of bringing the researchers working on different Indian scripts together. Looking at the recent developments in OHR of Indian regional scripts, this article will provide a better platform for future research activities.