A linear time approximation scheme for maximum quartet consistency on sparse sampled inputs

Authors:
Sagi Snir;Raphael Yuster
Affiliations:
Dept. of Evolutionary Biology, University of Haifa, Haifa, Israel;Dept. of Mathematics, University of Haifa, Haifa, Israel
Venue:
APPROX'11/RANDOM'11 Proceedings of the 14th international workshop and 15th international conference on Approximation, randomization, and combinatorial optimization: algorithms and techniques
Year:
2011

Citing 4
Cited 0

Constructing a tree from homeomorphic subtrees, with applications to computational evolutionary biology

Proceedings of the seventh annual ACM-SIAM symposium on Discrete algorithms
Approximation algorithms

Approximation algorithms
A Polynomial Time Approximation Scheme for Inferring Evolutionary Trees from Quartet Topologies and Its Application

SIAM Journal on Computing
Reconstructing approximate phylogenetic trees from quartet samples

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms

Quantified Score

Hi-index	0.00

Visualization

Abstract

Phylogenetic tree reconstruction is a fundamental biological problem. Quartet amalgamation - combining a set of trees over four taxa into a tree over the full set - stands at the heart of many phylogenetic reconstruction methods. However, even reconstruction from a consistent set of quartet trees, i.e. all quartets agree with some tree, is NP-hard, and the best approximation ratio known is 1/3. For a dense input of Θ(n4) quartets (not necessarily consistent), the problem has a polynomial time approximation scheme. When the number of taxa grows, considering such dense inputs is impractical and some sampling approach is imperative. In this paper we show that if the number of quartets sampled is at least Θ(n2) log n), there is a randomized approximation scheme, that runs in linear time in the number of quartets. The previously known polynomial approximation scheme for that problem required a very dense sample of size Θ(n4). We note that samples of size Θ(n2) log n) are sparse in the full quartet set.