Information Processing and Management: an International Journal
Computer Evaluation of Indexing and Text Processing
Journal of the ACM (JACM)
An infrastructure for open latent semantic linking
Proceedings of the thirteenth ACM conference on Hypertext and hypermedia
Modern Information Retrieval
A look at some issues during textual linking of homogeneous web repositories
Proceedings of the 2004 ACM symposium on Document engineering
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Prototyping Applications to Document Human Experiences
IEEE Pervasive Computing
Automatically linking live experiences captured with a ubiquitous infrastructure
Multimedia Tools and Applications
Effect of OCR error correction on Arabic retrieval
Information Retrieval
Hi-index | 0.00 |
Robust Information Retrieval (IR) systems have been demanded due to the widespread and multipurpose use of document images, and the high number of document images repositories available nowadays. This paper presents a novel approach to support the automatic generation of relationships among document images by exploiting Latent Semantic Indexing (LSI) and Optical Character Recognition (OCR). The LinkDI service extracts and indexes document images content, obtains its latent semantics, and defines relationships among images as hyperlinks. LinkDI was experimented with document images repositories, and its performance was evaluated by comparing the quality of the relationships created among textual documents and among their respective document images. Results show the feasibility of LinkDI relating OCR output with high degradation.