Effects of OCR errors on ranking and feedback using the vector space model
Information Processing and Management: an International Journal
Evaluation of Interest Point Detectors
International Journal of Computer Vision - Special issue on a special section on visual surveillance
Devising Interactive Access Techniques for Indian Language Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Scalable Recognition with a Vocabulary Tree
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Word spotting for historical documents
International Journal on Document Analysis and Recognition
Context-Sensitive Error Correction: Using Topic Models to Improve OCR
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Content-level Annotation of Large Collection of Printed Document Images
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Introduction to Information Retrieval
Introduction to Information Retrieval
Matching word images for content-based retrieval from printed document images
International Journal on Document Analysis and Recognition
Using topic models for OCR correction
International Journal on Document Analysis and Recognition - Special Issue NOISY
Efficient search in document image collections
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
Evaluating models of latent document semantics in the presence of OCR errors
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Experiences of integration and performance testing of multilingual OCR for printed Indian scripts
Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Searching OCR'ed Text: An LDA Based Approach
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
Word Image Retrieval Using Bag of Visual Words
DAS '12 Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems
An Efficient Framework for Searching Text in Noisy Document Images
DAS '12 Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems
Hi-index | 0.00 |
In this paper, we propose a framework for content level access to the scanned pages of Digital Library of India (DLI). The current Optical Character Recognition (OCR) systems are not robust and reliable enough for generating accurate text from DLI pages. We propose a search scheme which fuses noisy OCR output and holistic visual features for content level access to the DLI pages. Visual content is captured using Bag of Visual Words (BoVW) approach. We show that our fusion scheme improves over the individual methods in terms of mean Average Precision (mAP) and mean precision at 10 (mPrec@10). We exploit the fact that OCR has a high precision while BoVW has a high recall. We use a modified edit distance to improve the order of results ranked by BoVW. Experiments are carried out on large datasets of DLI pages in Hindi and Telugu languages. We validate our method on more than 10,000 pages and 4 Million words, and report a mAP of around 0.8 and mPrec@10 of more than 0.9. We show improvements over BoVW by introducing query expansion. We also demonstrate a textual query interface for the search system.