Heuristic compression of an English word list: Research Articles

  • Authors:
  • Robert P. Cook

  • Affiliations:
  • Department of Computer Sciences, Georgia Southern University, P.O. Box 7997, Statesboro, GA 30460, U.S.A.

  • Venue:
  • Software—Practice & Experience
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The goal of the project was to design and implement an English word-list representation suitable for spell-checking in space-constrained environments. The compression algorithm was derived by statistically analyzing the word list. A compression ratio of 18% was achieved through a combination of prefix and suffix encoding. The compressed file can be randomly accessed by prefix marker positions. A simple spell-checker based on the encoding was implemented and tested in Java. Copyright © 2005 John Wiley & Sons, Ltd.