Average complexity of backward q-gram string matching algorithms

  • Authors:
  • Leena Salmela

  • Affiliations:
  • Department of Computer Science and Helsinki Institute for Information Technology, P.O. Box 68, FI-00014 University of Helsinki, Finland

  • Venue:
  • Information Processing Letters
  • Year:
  • 2012

Quantified Score

Hi-index 0.89

Visualization

Abstract

Many efficient string matching algorithms make use of q-grams and process the text in windows which are read backward. In this paper we provide a framework for analyzing the average case complexity of these algorithms taking into account the statistical dependencies between overlapping q-grams. We apply this to the q-gram Boyer-Moore-Horspool algorithm adapted to various string matching problems and show that the algorithm is optimal on average.