Testing the correlation of word error rate and perplexity

  • Authors:
  • Dietrich Klakow;Jochen Peters

  • Affiliations:
  • Philips GmbH Forschungslaboratorien, Weisshausstr.2, D-52066 Aachen, Germany;Philips GmbH Forschungslaboratorien, Weisshausstr.2, D-52066 Aachen, Germany

  • Venue:
  • Speech Communication
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many groups have investigated the relationship of word error rate and perplexity of language models. This issue is of central interest because perplexity optimization can be done independent of a recognizer and in most cases it is possible to find simple perplexity optimization procedures. Moreover, many tasks in language model training such as the optimization of word classes may use perplexity as target function resulting in explicit optimization formulas which are not available if error rates are used as target. This paper first presents some theoretical arguments for a close relationship between perplexity and word error rate. Thereafter the notion of uncertainty of a measurement is introduced and is then used to test the hypothesis that word error rate and perplexity are correlated by a power law. There is no evidence to reject this hypothesis.