High-error approximate dictionary search using estimate hash comparisons

  • Authors:
  • Johan Rönnblom

  • Affiliations:
  • Senapsgatan 12, Gothenburg, Sweden

  • Venue:
  • Software—Practice & Experience
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

A method for finding all matches in a pre-processed dictionaryfor a query string q and with at most k differencesis presented. A very fast constant-time estimate using hashes ispresented. A tree structure is used to minimize the number ofestimates made. Practical tests are performed, showing that theestimate can filter out 99% of the full comparisons for 40% errorrates and dictionaries of up to four million words. The tree isfound to be efficient up to a 50% error rate. Copyright © 2006John Wiley & Sons, Ltd.