Information Retrieval in Document Image Databases
IEEE Transactions on Knowledge and Data Engineering
Quantifying information leakage in document redaction
Proceedings of the 1st ACM workshop on Hardcopy document processing
Hi-index | 0.00 |
We combine information from a language model and character image pattern matching to iteratively reduce ambiguity in document images. Combining word shape information and lists of similar bitmap patterns in a document at least partially resolves the character content without optical character recognition. We present the output in various ways. suitable for human readers or for differing downstream processes.