The distribution of subword counts is usually normal
European Journal of Combinatorics
A unified approach to word occurrence probabilities
Discrete Applied Mathematics - Special volume on combinatorial molecular biology
Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology
Estimating the Probability of Approximate Matches
CPM '97 Proceedings of the 8th Annual Symposium on Combinatorial Pattern Matching
ESA '99 Proceedings of the 7th Annual European Symposium on Algorithms
Hi-index | 0.00 |
Evaluation of the frequency of occurrences of a given set of patterns in a DNA sequence has numerous applications and has been extensively studied recently. We discuss the computational complexity for explicit formulae derived by several authors. We introduce a correlation automaton, that minimizes this complexity. This is crucial for practical applications. Notably, it allows to deal with the Markovian probability model. The case of patterns with some unspecified characters - approximate searching, regular expressions, ... - is addressed.