A search engine for imaged documents in PDF files

  • Authors:
  • Yue Lu;Li Zhang;Chew Lim Tan

  • Affiliations:
  • National University of Singapore, Kent Ridge, Singapore;National University of Singapore, Kent Ridge, Singapore;National University of Singapore, Kent Ridge, Singapore

  • Venue:
  • Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large quantities of documents in the Internet and digital libraries are simply scanned and archived in image format, many of which are packed in PDF files. The word search tool provided by Adobe Reader/Acrobat does not work for these imaged documents. In this paper, we present a search engine to deal with this issue for imaged documents in PDF files. The experimental results show an encouraging performance.