Adapting BLSTM Neural Network Based Keyword Spotting Trained on Modern Data to Historical Documents

  • Authors:
  • Volkmar Frinken;Andreas Fischer;Horst Bunke;R. Manmatha

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICFHR '10 Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Being able to search for words or phrases in historic handwritten documents is of paramount importance when preserving cultural heritage. Storing scanned pages of written text can save the information from degradation, but it does not make the textual information readily available. Automatic keyword spotting systems for handwritten historic documents can fill this gap. However, most such systems have trouble with the great variety of writing styles. It is not uncommon for handwriting processing systems to be built for just a single book. In this paper we show that neural network based keyword spotting systems are flexible enough to be used successfully on historic data, even when they are trained on a modern handwriting database. We demonstrate that with little transcribed historic text, added to the training set, the performance can further be enhanced.