Compressed by the Suffix Tree

  • Authors:
  • Martin Senft

  • Affiliations:
  • Charles University, Czech Republic

  • Venue:
  • DCC '06 Proceedings of the Data Compression Conference
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

A close inspection of Fiala and Green's implementation of Ziv-Lempel'77 dictionary compression method reveals a surprising inefficiency: The match they are searching for from the root down the suffix tree can be obtained for free from the suffix tree construction algorithm! This observation suggests that the output of this compression method is in a way just a description of the suffix tree construction for the input string. If taken one step further this leads to the idea that there exists a whole family of compression methods replacing the input string with a description of the suffix tree construction for this string. This family contains some (multi-)dictionary methods not dissimilar to those of Bloom or Hoang et.al., as well as some Prediction by Partial Matching variants. We give the general description of this family along with details about some of its members, discuss implementation issues and show some early experimental results. A possible application to other suffix structures like the Compact Directed Acyclic