Modelling electropherogram data for DNA sequencing using variable dimension MCMC

  • Authors:
  • N. M. Haan;S. J. Godsill

  • Affiliations:
  • Dept. of Eng., Cambridge Univ., UK;-

  • Venue:
  • ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 06
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

DNA sequencing may be considered as a two stage process: the generation of noisy data indicative of DNA sequence by using advanced chemical techniques; and the interpretation of that data. We present an algorithm for interpretation, or "base calling", which accurately models the underlying process, and is able to incorporate most of the prior information we possess in a mathematically tractable and minimally ad-hoc manner. Our algorithm is framed within a fully Bayesian probabilistic framework, thereby allowing representation of the random nature of the generative process, using a Reversible Jump Metropolis Hastings algorithm (1970) and the Gibbs sampler to traverse the variable dimension parameter space. The techniques used to construct our algorithm are feasible for use in such applications, due to their inherent computational requirements.