The indexing and retrieval of document images: a survey
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
Self-Organizing Maps
Imaged Document Text Retrieval Without OCR
IEEE Transactions on Pattern Analysis and Machine Intelligence
Information Retrieval from Documents: A Survey
Information Retrieval
Word Spotting in Bitmapped Fax Documents
Information Retrieval
R-trees: a dynamic index structure for spatial searching
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
The X-tree: An Index Structure for High-Dimensional Data
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Spotting Where to Read on Pages - Retrieval of Relevant Parts from Page Images
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Digital Libraries and Document Image Analysis
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Indexing and retrieval of words in old documents
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
IEEE Transactions on Knowledge and Data Engineering
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Embedded Map Projection for Dimensionality Reduction-Based Similarity Search
SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Nonlinear Embedded Map Projection for Dimensionality Reduction
ICIAP '09 Proceedings of the 15th International Conference on Image Analysis and Processing
Hi-index | 0.00 |
We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Organizing Maps, SOM) with Principal Component Analysis. The combination of these methods allows us to efficiently retrieve the matching words from large documents collections without the need for a direct comparison of the query word with each indexed word.