An OCR System to Read Two Indian Language Scripts: Bangla and Devnagari (Hindi)
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
A Complete Tamil Optical Character Recognition System
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Machine Recognition of Printed Kannada Text
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
On Developing High Accuracy OCR Systems for Telugu and Other Indian Scripts
LEC '02 Proceedings of the Language Engineering Conference (LEC'02)
A Complete OCR for Printed Hindi Text in Devanagari Script
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
A Bilingual OCR for Hindi-Telugu Documents and its Applications
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
An optical character recognition system for printed Telugu text
Pattern Analysis & Applications
An Objective Evaluation Methodology for Document Image Binarization Techniques
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
An analysis of binarization ground truthing
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
A post-processing scheme for malayalam using statistical sub-character language models
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Experiences of integration and performance testing of multilingual OCR for printed Indian scripts
Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Towards Improving the Accuracy of Telugu OCR Systems
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
OCR of printed telugu text with high recognition accuracies
ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Hi-index | 0.00 |
Performance evaluation of End-to-End OCR systems of Indic scripts requires matching of UNICODE sequences of OCR output and ground truth. In the literature, Levenshtein edit distance has been used to compute error rates of OCR systems but the accuracies are not explicitly reported. In the present work, we have proposed an accuracy measure based on edit distance and used it in conjunction with error rate to report the performance of an OCR system. We have analyzed the relationship between accuracy and error rates in a quantitative manner. Our analysis has shown that accuracy and error rate are independent of each other and so both are needed to report complete performance of an OCR system. Proposed approach is applicable to all the Indic scripts and the experimental results on different scripts like Devanagari, Telugu, Kannada etc. are shown.