A linear time approximation scheme for maximum quartet consistency on sparse sampled inputs

  • Authors:
  • Sagi Snir;Raphael Yuster

  • Affiliations:
  • Dept. of Evolutionary Biology, University of Haifa, Haifa, Israel;Dept. of Mathematics, University of Haifa, Haifa, Israel

  • Venue:
  • APPROX'11/RANDOM'11 Proceedings of the 14th international workshop and 15th international conference on Approximation, randomization, and combinatorial optimization: algorithms and techniques
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Phylogenetic tree reconstruction is a fundamental biological problem. Quartet amalgamation - combining a set of trees over four taxa into a tree over the full set - stands at the heart of many phylogenetic reconstruction methods. However, even reconstruction from a consistent set of quartet trees, i.e. all quartets agree with some tree, is NP-hard, and the best approximation ratio known is 1/3. For a dense input of Θ(n4) quartets (not necessarily consistent), the problem has a polynomial time approximation scheme. When the number of taxa grows, considering such dense inputs is impractical and some sampling approach is imperative. In this paper we show that if the number of quartets sampled is at least Θ(n2) log n), there is a randomized approximation scheme, that runs in linear time in the number of quartets. The previously known polynomial approximation scheme for that problem required a very dense sample of size Θ(n4). We note that samples of size Θ(n2) log n) are sparse in the full quartet set.