Indexing and retrieval of words in old documents
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Document Image Retrieval Based on Density Distribution Feature and Key Block Feature
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Query driven word retrieval in graphical documents
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Hi-index | 0.00 |
Abstract: It is important to utilize retrospective documents. OCR is the most widely applied technology for this purpose; however, error-tolerant methods are essential for utilizing OCR-processed documents. This paper discusses a filtering problem for OCR-processed documents that enables the handling of large numbers of OCR-processed documents in an error-tolerant way. We propose a systematic index design method for filtering and show that the filtering method speeds up by about 360 times for a database consisting of about two million records, with little decrease in accuracy.