The effects of noisy data on text retrieval
Journal of the American Society for Information Science
Content-based retrieval for music collections
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Automatic cataloguing and searching for retrospective data by use of OCR text
Journal of the American Society for Information Science and Technology
Automatic thesaurus generation for Chinese documents
Journal of the American Society for Information Science and Technology
An unsupervised and data-driven approach for spell checking in Vietnamese OCR-scanned texts
HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
Hi-index | 0.00 |
This article proposes a technique for correcting Chinese OCR errors to support retrieval of scanned documents. The technique uses a completely automatic technique (no manually constructed lexicons or confusion resources) to identify both keywords and confusable terms. Improved retrieval effectiveness on a single term query experiment is demonstrated.