Keyword-guided word spotting in historical printed documents using synthetic data and user feedback

  • Authors:
  • T. Konidaris;B. Gatos;K. Ntzios;I. Pratikakis;S. Theodoridis;S. J. Perantonis

  • Affiliations:
  • National Center for Scientific Research “Demokritos”, Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, Athens, Greece;National Center for Scientific Research “Demokritos”, Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, Athens, Greece;Natnl. Ctr. for Sci. Res. “Demokritos”, Computnl. Intell. Lab., Inst. of Inform. and Telecomm., Athens, Greece and Natnl. & Kapodistrian Univ. of Athens, Dept. of Inform. and Telecom ...;National Center for Scientific Research “Demokritos”, Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, Athens, Greece;National & Kapodistrian University of Athens, Department of Informatics and Telecommunications, Athens, Greece;National Center for Scientific Research “Demokritos”, Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, Athens, Greece

  • Venue:
  • International Journal on Document Analysis and Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a novel technique for word spotting in historical printed documents combining synthetic data and user feedback. Our aim is to search for keywords typed by the user in a large collection of digitized printed historical documents. The proposed method consists of the following stages: (1) creation of synthetic image words; (2) word segmentation using dynamic parameters; (3) efficient feature extraction for each word image and (4) a retrieval procedure that is optimized by user feedback. Experimental results prove the efficiency of the proposed approach.