Comparing Natural and Synthetic Training Data for Off-Line Cursive Handwriting Recognition

Authors:
Tamas Varga;Horst Bunke
Affiliations:
Universität Bern;Universität Bern
Venue:
IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
Year:
2004

Citing 0
Cited 2

Technical Section: Neural network-based symbol recognition using a few labeled samples

Computers and Graphics
Improving classification for microarray data sets by constructing synthetic data

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a perturbation model for the generation of synthetic textlines from existing cursively handwritten lines of text, produced by human writers, is presented. The goal of synthetic textline generation is to improve the performance of an off-line cursive handwriting recognition system by providing it with additional, synthetic training data. In earlier papers, it has been shown that it is possible to improve the recognition performance by using such synthetically expanded training sets. In this paper, we investigate the suitability of synthetically generated handwriting when enlarging the training set of a handwriting recognition system in a more rigorous way. In particular, the improvements achieved with synthetic training data are compared to those achieved by expanding the training set using natural, i.e. human written, textlines.