Algorithms for variable length Markov chain modeling

  • Authors:
  • Gill Bejerano

  • Affiliations:
  • Center for Biomolecular Science and Engineering, School of Engineering, University of California, Santa Cruz, CA 95064, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2004

Quantified Score

Hi-index 3.84

Visualization

Abstract

Summary: We present a general purpose implementation of variable length Markov models. Contrary to fixed order Markov models, these models are not restricted to a predefined uniform depth. Rather, by examining the training data, a model is constructed that fits higher order Markov dependencies where such contexts exist, while using lower order Markov dependencies elsewhere. As both theoretical and experimental results show, these models are capable of capturing rich signals from a modest amount of training data, without the use of hidden states. Availability: The source code is freely available at http://www.soe.ucsc.edu/~jill/src/