Texture for Script Identification
IEEE Transactions on Pattern Analysis and Machine Intelligence
Script Identification Using Steerable Gabor Filters
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Local features-based script recognition from printed bilingual document images
International Journal of Computer Applications in Technology
Bangla/English script identification based on analysis of connected component profiles
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Multilingual OCR research and applications: an overview
Proceedings of the 4th International Workshop on Multilingual OCR
Hi-index | 0.00 |
Abstract: In a general situation, a document page may contain several script forms. For Optical Character Recognition (OCR) of such a document page, it is necessary to separate the scripts before feeding them to their individual OCR systems. In this paper, an automatic technique for the identification of printed Roman, Chinese, Arabic, Devnagari and Bangla text lines from a single document is proposed. Shape based features, statistical features and some features obtained from the concept of water reservoir have been used for script identification. The proposed scheme has an accuracy of about 97.33%.