Text retrieval from early printed books
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
A survey of keyword spotting techniques for printed document images
Artificial Intelligence Review
Discrete point based signatures and applications to document matching
ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
A symbol spotting approach in graphical documents by hashing serialized graphs
Pattern Recognition
Document page retrieval based on geometric layout features
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
Amharic document image retrieval using morphological coding
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Keyword spotting in unconstrained handwritten Chinese documents using contextual word model
Image and Vision Computing
Near-duplicate document image matching: A graphical perspective
Pattern Recognition
Hi-index | 0.14 |
This paper presents a document retrieval technique that is capable of searching document images without OCR (optical character recognition). The proposed technique retrieves document images by a new word shape coding scheme, which captures the document content through annotating each word image by a word shape code. In particular, we annotate word images by using a set of topological shape features including character ascenders/descenders, character holes, and character water reservoirs. With the annotated word shape codes, document images can be retrieved by either query keywords or a query document image. Experimental results show that the proposed document image retrieval technique is fast, efficient, and tolerant to various types of document degradation.