Searching for historical word-forms in a database of 17th-century English text using spelling-correction methods

  • Authors:
  • Alexander M. Robertson;Peter Willett

  • Affiliations:
  • Department of Information Studies, University of Sheffield, Western Bank, Sheffield, UK, S10 2TN;Department of Information Studies, University of Sheffield, Western Bank, Sheffield, UK, S10 2TN

  • Venue:
  • SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper discusses the application of algorithmic spelling-correction techniques to the identification of those words in a database of 17th century English text that are most similar to a query word in modern English. The experiments have used n-gram matching, non-phonetic coding and dynamic programming methods for spelling correction, and have demonstrated that high-recall searches can be carried out, although some of the searches are very demanding of computational resources. The methods are, in principle, applicable to historical texts in many languages and from many diffeent periods.