Time/Space Efficient Compressed Pattern Matching

  • Authors:
  • Leszek Gasieniec;Igor Potapov

  • Affiliations:
  • -;-

  • Venue:
  • FCT '01 Proceedings of the 13th International Symposium on Fundamentals of Computation Theory
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

An exact pattern matching problem is to find all occurrences of a pattern p in a text t. We say that the pattern matching algorithm is optimal if its running time is linear in the sizes of t and p, i.e. O(t + p). Perhaps one of the most interesting settings of the pattern matching problem is when one has to design an efficient algorithm with a help of small extra space. In this paper we explore this setting to the extreme.We use an additional assumption that the text t is available only in a compressed form, represented by a straight-line program. The compression methods based on efficient construction of straight-line programs are as competitive as the compression standards, including Lempel-Ziv's compression scheme and recently intensively studied compression via block sorting, due to Burrows and Wheeler. Our main result consists in solving compressed string matching problem in optimal linear time when only a constant size of extra space is available. We also discuss an efficient implementation of a version our algorithm showing that the new concept may have also interesting real applications.