Language identification for interactive handwriting transcription of multilingual documents
IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Lampung - a new handwritten character benchmark: database, labeling and recognition
Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Handwriting Recognition in Indian Regional Scripts: A Survey of Offline Techniques
ACM Transactions on Asian Language Information Processing (TALIP)
Proceeding of the workshop on Document Analysis and Recognition
An empirical intrinsic mode based characterization of Indian scripts
Proceeding of the workshop on Document Analysis and Recognition
A bilingual Gurmukhi-English OCR based on multiple script identifiers and language models
Proceedings of the 4th International Workshop on Multilingual OCR
Hi-index | 0.14 |
A variety of different scripts are used in writing languages throughout the world. In a multiscript, multilingual environment, it is essential to know the script used in writing a document before an appropriate character recognition and document analysis algorithm can be chosen. In view of this, several methods for automatic script identification have been developed so far. They mainly belong to two broad categories—structure-based and visual-appearance-based techniques. This survey report gives an overview of the different script identification methodologies under each of these categories. Methods for script identification in online data and video-texts are also presented. It is noted that the research in this field is relatively thin and still more research is to be done, particularly in the case of handwritten documents.