Probabilistic models for biological sequences: selection and Maximum Likelihood estimation

  • Authors:
  • Svetlana Ekisheva;Mark Borodovsky

  • Affiliations:
  • School of Biology, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA.;Department of Biomedical Engineering, School of Biology, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA

  • Venue:
  • International Journal of Bioinformatics Research and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Probabilistic models for biological sequences (DNA and proteins) are frequently used in bioinformatics. We describe statistical tests designed to detect the order of dependency among elements of the sequence and to select the most appropriate probabilistic model for an experimental biological sequence. For a model of given type, the independence model, the first-order Markov chain and the hidden Markov model (HMM), we derive the uniform lower bound for the rate of decay for the errors of the maximum likelihood (ML) estimates of the model parameters and, subsequently, the uniform confidence intervals for the parameters.