N-Gram Language Models for Offline Handwritten Text Recognition

  • Authors:
  • Matthias Zimmermann;Horst Bunke

  • Affiliations:
  • University of Bern;University of Bern

  • Venue:
  • IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper investigates the impact of bigram and trigram language models on the performance of a Hidden Markov Model (HMM) based offline recognition system for handwritten sentences. The language models are trained on the LOB corpus which is supplemented by various additional sources of text, including sentences from additional corpora and random sentences produced by a stochastic context-free grammar (SCFG). Experimental results are provided in terms of test set perplexity and performance of the corresponding recognition systems. For the text recognition experiments handwritten material from the IAM database has been used.