Comparing canonicalizations of historical German text

  • Authors:
  • Bryan Jurish

  • Affiliations:
  • Berlin-Brandenburg Academy of Sciences, Berlin, Germany

  • Venue:
  • SIGMORPHON '10 Proceedings of the 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Historical text presents numerous challenges for contemporary natural language processing techniques. In particular, the absence of consistent orthographic conventions in historical text presents difficulties for any system requiring reference to a static lexicon accessed by orthographic form. In this paper, we present three methods for associating unknown historical word forms with synchronically active canonical cognates and evaluate their performance on an information retrieval task over a manually annotated corpus of historical German verse.