Comparing Natural and Synthetic Training Data for Off-Line Cursive Handwriting Recognition

  • Authors:
  • Tamas Varga;Horst Bunke

  • Affiliations:
  • Universität Bern;Universität Bern

  • Venue:
  • IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a perturbation model for the generation of synthetic textlines from existing cursively handwritten lines of text, produced by human writers, is presented. The goal of synthetic textline generation is to improve the performance of an off-line cursive handwriting recognition system by providing it with additional, synthetic training data. In earlier papers, it has been shown that it is possible to improve the recognition performance by using such synthetically expanded training sets. In this paper, we investigate the suitability of synthetically generated handwriting when enlarging the training set of a handwriting recognition system in a more rigorous way. In particular, the improvements achieved with synthetic training data are compared to those achieved by expanding the training set using natural, i.e. human written, textlines.