Approximate string matching with reduced alphabet

  • Authors:
  • Leena Salmela;Jorma Tarhio

  • Affiliations:
  • Department of Computer Science, University of Helsinki;Department of Computer Science and Engineering, Aalto University

  • Venue:
  • Algorithms and Applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a method to speed up approximate string matching by mapping the factual alphabet to a smaller alphabet. We apply the alphabet reduction scheme to a tuned version of the approximate Boyer–Moore algorithm utilizing the Four-Russians technique. Our experiments show that the alphabet reduction makes the algorithm faster. Especially in the k-mismatch case, the new variation is faster than earlier algorithms for English data with small values of k.