Fast Compression with a Static Model in High-Order Entropy

  • Authors:
  • Luca Foschini;Roberto Grossiy;Ankur Guptaz;Jeffrey Scott Vitter

  • Affiliations:
  • -;-;-;-

  • Venue:
  • DCC '04 Proceedings of the Conference on Data Compression
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We report on a simple encoding format called wzip for decompressing block-sorting transforms, such as the Burrows-Wheeler Transform (BWT). Our compressor uses the simple notions of gamma encoding and RLE, organized witha wavelet tree, to achieve a slightly better compression ratio than bzip2 inless time. In fact, our compression/decompression time is dependent on Hh,the hth order empirical entropy. This relationship of performance to the compressibility of data is a key new idea among compression algorithms. Anotherkey contribution of our compressor is its simplicity. Our compressor can alsooperate as a full-text index with a small amount of data, while still preservingbackward compatibility with just the compressor.