On the Recognition of Printed Characters of Any Font and Size
IEEE Transactions on Pattern Analysis and Machine Intelligence
The indexing and retrieval of document images: a survey
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
Quality of OCR for degraded text images
Proceedings of the fourth ACM conference on Digital libraries
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Statistical, Nonparametric Methodology for Document Degradation Model Validation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Digital Image Processing
A New Textual/Non-Textual Classifier for Document Skew Correction
ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 1 - Volume 1
A Bilingual OCR for Hindi-Telugu Documents and its Applications
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Recognition of Printed Amharic Documents
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Learning to segment document images
PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
A new generation of textual corpora: mining corpora from very large collections
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Self adaptable recognizer for document image collections
PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
Hi-index | 0.00 |
This paper presents a novel approach for designing a semi-automatic adaptive OCR for large document image collections in digital libraries. We describe an interactive system for continuous improvement of the results of the OCR. In this paper a semi-automatic and adaptive system is implemented. Applicability of our design for the recognition of Indian Languages is demonstrated. Recognition errors are used to train the OCR again so that it adapts and learns for improving its accuracy. Limited human intervention is allowed for evaluating the output of the system and take corrective actions during the recognition process.