A practical approximation algorithm for solving massive instances of hybridization number

  • Authors:
  • Leo van Iersel;Steven Kelk;Nela Lekić;Celine Scornavacca

  • Affiliations:
  • Centrum Wiskunde & Informatica (CWI), Amsterdam, The Netherlands;Department of Knowledge Engineering (DKE), Maastricht University, Maastricht, The Netherlands;Department of Knowledge Engineering (DKE), Maastricht University, Maastricht, The Netherlands;Institut des Sciences de l'Evolution (ISEM, UMR 5554 CNRS), Université Montpellier II, Montpellier Cedex 5, France

  • Venue:
  • WABI'12 Proceedings of the 12th international conference on Algorithms in Bioinformatics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. In practice, exact solvers struggle to solve instances with reticulation number larger than 40. For such instances, one has to resort to heuristics and approximation algorithms. Here we present the algorithm CycleKiller which is the first approximation algorithm that can produce solutions verifiably close to optimality for instances with hundreds or even thousands of reticulations. Theoretically, the algorithm is an exponential-time 2-approximation (or 4-approximation in its fastest mode). However, using simulations we demonstrate that in practice the algorithm runs quickly for large and difficult instances, producing solutions within one percent of optimality. An implementation of this algorithm, which extends the theoretical work of [14], has been made publicly available.