A Bilingual OCR for Hindi-Telugu Documents and its Applications
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Zone Identification in the Printed Gujarati Text
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Fuzzy model based recognition of handwritten numerals
Pattern Recognition
Use of MKL as symbol classifier for Gujarati character recognition
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
OCR of printed telugu text with high recognition accuracies
ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Recognition of Bangla compound characters using structural decomposition
Pattern Recognition
Hi-index | 0.00 |
This paper describes the classification of a subset of printed or digitized Gujarati characters. Gujarati belongs to the genre of Devanagri scripts from the Indian subcontinent. Very little work is found in the literature for recognition of Indian language scripts. For this paper a subset of similar appearing Gujarati characters was chosen and subjected to classification by different classifiers. The sample and test images for the characters were obtained from digital images available on the Internet and from scanned images of printed Gujarati text. For their classification, the Euclidean Minimum Distance and the {\it k}--Nearest Neighbor classifiers were used with regular and invariant moments. The characters were also classified in the binary feature space using Hamming Distance classifier. The paper presents the recognition rates for these classifiers. A recognition rate of $67\%$ is achieved. The work described in this paper is preliminary; however, since ICDAR'99 is being held in India, we hope that this would be of interest to the participants.