Compressed Pattern Matching for Sequitur

  • Authors:
  • Shuichi Mitarai;Masahiro Hirao;Tetsuya Matsumoto;Ayumi Shinohara;Setsuo Arikawa;Masayuki Takeda

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • DCC '01 Proceedings of the Data Compression Conference
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract: Sequitur due to Nevill-Manning and Witten [19] is a powerful program to infer a phrase hierarchy from the input text, that also provides extremely effective compression of large quantities of semi-structured text [18]. In this paper, we address the problem of searching in Sequitur compressed text directly. We show a compressed pattern matching algorithm that finds a pattern in compressed text without explicit decompression. We show that our algorithm is approximately 1.27 times faster than a decompression followed by an ordinal search.