Efficient string matching with k mismatches
Theoretical Computer Science
SIAM Journal on Computing
Introduction to algorithms
Faster Algorithms for String Matching Problems: Matching the Convolution Bound
FOCS '98 Proceedings of the 39th Annual Symposium on Foundations of Computer Science
Faster algorithms for string matching with k mismatches
Journal of Algorithms - Special issue: SODA 2000
Approximating general metric distances between a pattern and a text
Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
ESA'07 Proceedings of the 15th annual European conference on Algorithms
Hi-index | 0.00 |
We consider the well known problem of pattern matching under the Hamming distance. Previous approaches have shown how to count the number of mismatches efficiently, especially when a bound is known for the maximum Hamming distance. Our interest is different in that we wish collect a random sample of mismatches of fixed size at each position in the text. Given a pattern p of length m and a text t of length n , we show how to sample with high probability c mismatches where possible from every alignment of p and t in O ((c + logn )(n + m logm )logm ) time. Further, we guarantee that the mismatches are sampled uniformly and can therefore be seen as representative of the types of mismatches that occur.