Accurate computation of likelihoods in the coalescent with recombination via parsimony

Authors:
Rune B. Lyngsø;Yun S. Song;Jotun Hein
Affiliations:
Department of Statistics, Oxford University, Oxford, UK;Computer Science Division and Department of Statistics, University of California, Berkeley, CA;Department of Statistics, Oxford University, Oxford, UK
Venue:
RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology
Year:
2008

Citing 8
Cited 2

The Number of Recombination Events in a Sample History: Conflict Graph and Lower Bounds

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Optimal, efficient reconstruction of root-unknown phylogenetic networks with constrained and structured recombination

Journal of Computer and System Sciences - Special issue on bioinformatics II
Efficient computation of close lower and upper bounds on the minimum number of recombinations in biological sequence evolution

Bioinformatics
Counting All Possible Ancestral Configurations of Sample Sequences in Population Genetics

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Computing the minimum number of hybridization events for a consistent evolutionary history

Discrete Applied Mathematics
The Fine Structure of Galls in Phylogenetic Networks

INFORMS Journal on Computing
Minimum recombination histories by branch and bound

WABI'05 Proceedings of the 5th International conference on Algorithms in Bioinformatics
Improved recombination lower bounds for haplotype data

RECOMB'05 Proceedings of the 9th Annual international conference on Research in Computational Molecular Biology

Exact Computation of Coalescent Likelihood under the Infinite Sites Model

ISBRA '09 Proceedings of the 5th International Symposium on Bioinformatics Research and Applications
Exact Computation of Coalescent Likelihood for Panmictic and Subdivided Populations under the Infinite Sites Model

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Understanding the variation of recombination rates across a given genome is crucial for disease gene mapping and for detecting signatures of selection, to name just a couple of applications. A widely-used method of estimating recombination rates is the maximum likelihood approach, and the problem of accurately computing likelihoods in the coalescent with recombination has received much attention in the past. A variety of sampling and approximation methods have been proposed, but no single method seems to perform consistently better than the rest, and there still is great value in developing better statistical methods for accurately computing likelihoods. So far, with the exception of some twolocus models, it has remained unknown how the true likelihood exactly behaves as a function of model parameters, or how close estimated likelihoods are to the true likelihood. In this paper, we develop a deterministic, parsimony-based method of accurately computing the likelihood for multi-locus input data of moderate size. We first find the set of all ancestral configurations (ACs) that occur in evolutionary histories with at most k crossover recombinations. Then, we compute the likelihood by summing over all evolutionary histories that can be constructed only using the ACs in that set. We allow for an arbitrary number of crossing over, coalescent and mutation events in a history, as long as the transitions stay within that restricted set of ACs. For given parameter values, by gradually increasing the bound k until the likelihood stabilizes, we can obtain an accurate estimate of the likelihood. At least for moderate crossover rates, the algorithm-based method described here opens up a new window of opportunities for testing and fine-tuning statistical methods for computing likelihoods.