Truncated suffix trees and their application to data compression

  • Authors:
  • Joong Chae Na;Alberto Apostolico;Costas S. Iliopoulos;Kunsoo Park

  • Affiliations:
  • School of Computer Science & Engineering, Seoul National University, South Korea;Dipartimento di Elettronica e Informatica, Università di Padova, Italy and Department of Computer Sciences, Purdue University;Department of Computer Science, King's College London, UK and School of Computing, Curtin University of Technology, Australia;School of Computer Science & Engineering, Seoul National University, South Korea

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2003

Quantified Score

Hi-index 5.23

Visualization

Abstract

The suffix tree is a fundamental data structure in the area of string algorithms and it has been used in many applications including data compression. In this paper we propose a data structure called the truncated suffix tree, which is a truncated version of the suffix tree. We also present two linear-time construction algorithms for truncated suffix trees and two algorithms that delete suffixes from truncated suffix trees.The truncated suffix tree is particularly a useful data structure for LZ77 that compresses using a sliding window of a fixed size. Our algorithms lead to two implementations of LZ77 that maintain sliding windows by truncated suffix trees. We also present a technique of finding the longest match in a sliding window, which is a crucial step in LZ77.