An algorithm for constructing parsimonious hybridization networks with multiple phylogenetic trees

  • Authors:
  • Yufeng Wu

  • Affiliations:
  • Department of Computer Science and Engineering, University of Connecticut, Storrs, CT

  • Venue:
  • RECOMB'13 Proceedings of the 17th international conference on Research in Computational Molecular Biology
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Phylogenetic network is a model for reticulate evolution. Hybridization network is one type of phylogenetic network for a set of discordant gene trees, and "displays" each gene tree. A central computational problem on hybridization networks is: given a set of gene trees, reconstruct the minimum (i.e. most parsimonious) hybridization network that displays each given gene tree. This problem is known to be NP-hard, and existing approaches for this problem are either heuristics or make simplifying assumptions (e.g. work with only two input trees or assume some topological properties). In this paper, we develop an exact algorithm (called PIRNC) for inferring the minimum hybridization networks from multiple gene trees. The PIRNC algorithm does not rely on structural assumptions. To the best of our knowledge, PIRNC is the first exact algorithm for this formulation. When the number of reticulation events is relatively small (say four or fewer), PIRNC runs reasonably efficient even for moderately large datasets. For building more complex networks, we also develop a heuristic version of PIRNC called PIRNCH. Simulation shows that PIRNCH usually produces networks with fewer reticulation events than those by an existing method.