The use of error tags in ARTFL's Encyclopédie: does good error identification lead to good error correction?

  • Authors:
  • Derrick Higgins

  • Affiliations:
  • University of Chicago

  • Venue:
  • Proceedings of the workshop on Student research
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many corpora which are prime candidates for automatic error correction, such as the output of OCR software, and electronic texts incorporating markup tags, include information on which portions of the text are most likely to contain errors.This paper describes how the error markup tag is being incorporated in the spell-checking of an electronic version of Diderot's Encyclopédie, and evaluates whether the presence of this tag has significantly aided in correcting the errors which it marks. Although the usefulness of error tagging may vary from project to project, even as the precise way in which the tagging is done varies, error tagging does not necessarily confer any benefit in attempting to correct a given word. It may, of course, nevertheless be useful in marking errors to be fixed manually at a later stage of processing the text.