Text Compression via Alphabet Re-Representation

  • Authors:
  • Philip M. Long;Apostol I. Natsev;Jeffrey Scott Vitter

  • Affiliations:
  • -;-;-

  • Venue:
  • DCC '97 Proceedings of the Conference on Data Compression
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider re-representing the alphabet so that a representation of a character reflects its properties as a predictor of future text. This enables us to use an estimator from a restricted class to map contexts to predictions of upcoming characters. We describe an algorithm that uses this idea in conjunction with neural networks. The performance of this implementation is compared to other compression methods, such as UNIX compress, gzip, PPMC, and an alternative neural network approach.