An Overhead Reduction Technique for Mega-State Compression Schemes

  • Authors:
  • A. Bookstein;S. T. Klein;T. Raita

  • Affiliations:
  • -;-;-

  • Venue:
  • DCC '97 Proceedings of the Conference on Data Compression
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many of the most effective compression methods involve complicated models. Unfortunately, as model complexity increases, so does the cost of storing the model itself. This paper examines a method to reduce the amount of storage needed to represent a Markov model with an extended alphabet, by applying a clustering scheme that brings together similar states. Experiments run on a variety of large natural language texts show that much of the overhead of storing the model can be saved at the cost of a very small loss of compression efficiency.