String Matching with Stopper Encoding and Code Splitting

  • Authors:
  • Jussi Rautio;Jani Tanninen;Jorma Tarhio

  • Affiliations:
  • -;-;-

  • Venue:
  • CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider exact string searching in compressed texts. We utilize a semi-static compression scheme, where characters of the text are encoded as variable-length sequences of base symbols, each of which is represented by a fixed number of bits. In addition, we split the symbols into two parallel files in order to allow faster access. Our searching algorithm is a modification of the Boyer-Moore-Horspool algorithm. Our approach is practical and enables faster searching of string patterns than earlier character-based compression models and the best Boyer-Moore variants in uncompressed texts.