Block Merging for Off-Line Compression

  • Authors:
  • Raymond Wan;Alistair Moffat

  • Affiliations:
  • -;-

  • Venue:
  • CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

To bound memory consumption, most compression systems provide a facility that controls the amount of data that may be processed at once. In this work we consider the RE-PAIR mechanism of Larsson and Moffat [2000], which processes large messages as disjoint blocks. We show that the blocks emitted by RE-PAIR can be post-processed to yield further savings, and describe techniques that allow files of 500 MB or more to be compressed in a holistic manner using less than that much main memory. The block merging process we describe has the additional advantage of allowing new text to be appended to the end of the compressed file.