A statistical global feature extraction method for optical font recognition
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
A novel statistical feature extraction method for textual images: Optical font recognition
Expert Systems with Applications: An International Journal
An empirical intrinsic mode based characterization of Indian scripts
Proceeding of the workshop on Document Analysis and Recognition
Multilingual OCR research and applications: an overview
Proceedings of the 4th International Workshop on Multilingual OCR
Arabic font recognition based on diacritics features
Pattern Recognition
Hi-index | 0.00 |
Automatic identification of a script in a given document image facilitates many important applications such as automatic archiving of multilingual documents, searching online archives of document images and for the selection of script-specific OCR in a multi-lingual environment. In this paper, we model script identification as a texture classification problem and examine a global approach inspired by human visual perception. A generalised, hierarchical framework is proposed for script identification. A set of energy and intensity space features for this task is also presented. The framework serves to establish the utility of a global approach to the classification of scripts. The framework has been tested on two datasets: 10 Indian and 13 world scripts. The obtained accuracy of identification across the two datasets is above 94%. The results demonstrate that the framework can be used to develop solutions for script identification from document images across a large set of script classes.