Online dictionary matching with variable-length gaps

  • Authors:
  • Tuukka Haapasalo;Panu Silvasti;Seppo Sippu;Eljas Soisalon-Soininen

  • Affiliations:
  • Aalto University School of Science;Aalto University School of Science;University of Helsinki;Aalto University School of Science

  • Venue:
  • SEA'11 Proceedings of the 10th international conference on Experimental algorithms
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The string-matching problem with wildcards is considered in the context of online matching of multiple patterns. Our patterns are strings of characters in the input alphabet and of variable-length gaps, where the width of a gap may vary between two integer bounds or from an integer lower bound to infinity. Our algorithm is based on locating "keywords" of the patterns in the input text, that is, maximal substrings of the patterns that contain only input characters. Matches of prefixes of patterns are collected from the keyword matches, and when a prefix constituting a complete pattern is found, a match is reported. In collecting these partial matches we avoid locating those keyword occurrences that cannot participate in any prefix of a pattern found thus far. Our experiments show that our algorithm scales up well, when the number of patterns increases.