Worst-case optimal approximation algorithms for maximizing triplet consistency within phylogenetic networks

  • Authors:
  • Jaroslaw Byrka;Pawel Gawrychowski;Katharina T. Huber;Steven Kelk

  • Affiliations:
  • Centrum voor Wiskunde en Informatica, Kruislaan 413, NL-1098 SJ Amsterdam, The Netherlands and Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, The Netherlands;Institute of Computer Science, University of Wroclaw, ul. Joliot-Curie 15, 50-383 Wroclaw, Poland;School of Computing Sciences, University of East Anglia, Norwich, NR4 7TJ, United Kingdom;Centrum voor Wiskunde en Informatica, Kruislaan 413, NL-1098 SJ Amsterdam, The Netherlands

  • Venue:
  • Journal of Discrete Algorithms
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The study of phylogenetic networks is of great interest to computational evolutionary biology and numerous different types of such structures are known. This article addresses the following question concerning rooted versions of phylogenetic networks. What is the maximum value of p@?[0,1] such that for every input set T of rooted triplets, there exists some network N such that at least p|T| of the triplets are consistent with N? We call an algorithm that computes such a network (where p is maximum) worst-case optimal. Here we prove that the set containing all triplets (the full triplet set) in some sense defines p. Moreover, given a network N that obtains a fraction p^' for the full triplet set (for any p^'), we show how to efficiently modify N to obtain a fraction =p^' for any given triplet set T. We demonstrate the power of this insight by presenting a worst-case optimal result for level-1 phylogenetic networks improving considerably upon the 5/12 fraction obtained recently by Jansson, Nguyen and Sung. For level-2 phylogenetic networks we show that p=0.61. We emphasize that, because we are taking |T| as a (trivial) upper bound on the size of an optimal solution for each specific input T, the results in this article do not exclude the existence of approximation algorithms that achieve approximation ratio better than p. Finally, we note that all the results in this article also apply to weighted triplet sets.