Suffix arrays: a new method for on-line string searches
SIAM Journal on Computing
A comparison of imperative and purely functional suffix tree constructions
ESOP '94 Selected papers of ESOP '94, the 5th European symposium on Programming
The power of amnesia: learning probabilistic automata with variable memory length
Machine Learning - Special issue on COLT '94
Reducing the space requirement of suffix trees
Software—Practice & Experience
Replacing suffix trees with enhanced suffix arrays
Journal of Discrete Algorithms - SPIRE 2002
Algorithms for variable length Markov chain modeling
Bioinformatics
Computing suffix links for suffix trees and arrays
Information Processing Letters
Compressed representations of sequences and full-text indexes
ACM Transactions on Algorithms (TALG)
Discriminative feature selection via multiclass variable memory Markov model
EURASIP Journal on Applied Signal Processing
International Journal of Bioinformatics Research and Applications
Computational Biology and Chemistry
LATA'10 Proceedings of the 4th international conference on Language and Automata Theory and Applications
Probabilistic Arithmetic Automata and Their Applications
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hi-index | 0.00 |
Variable order Markov chains (VOMCs) are a flexible class of models that extend the well-known Markov chains. They have been applied to a variety of problems in computational biology, e.g. protein family classification. A linear time and space construction algorithm has been published in 2000 by Apostolico and Bejerano. However, neither a report of the actual running time nor an implementation of it have been published since. In this paper we use the lazy suffix tree and the enhanced suffix array to improve upon the algorithm of Apostolico and Bejerano. We introduce a new software which is orders of magnitude faster than current tools for building VOMCs, and is suitable for large scale sequence analysis.