Identifying rogue taxa through reduced consensus: NP-Hardness and exact algorithms

  • Authors:
  • Akshay Deepak;Jianrong Dong;David Fernández-Baca

  • Affiliations:
  • Department of Computer Science, Iowa State University, Ames, Iowa;Department of Computer Science, Iowa State University, Ames, Iowa;Department of Computer Science, Iowa State University, Ames, Iowa

  • Venue:
  • ISBRA'12 Proceedings of the 8th international conference on Bioinformatics Research and Applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

A rogue taxon in a collection of phylogenetic trees is one whose position varies drastically from tree to tree. The presence of such taxa can greatly reduce the resolution of the consensus tree (e.g., the majority-rule or strict consensus) for a collection. The reduced consensus approach aims to identify and eliminate rogue taxa to produce more informative consensus trees. Given a collection of phylogenetic trees over the same leaf set, the goal is to find a set of taxa whose removal maximizes the number of internal edges in the consensus tree of the collection. We show that this problem is NP-hard for strict and majority-rule consensus. We give a polynomial-time algorithm for reduced strict consensus when the maximum degree of the strict consensus of the original trees is bounded. We describe exact integer linear programming formulations for computing reduced strict, majority and loose consensus trees. In experimental tests, our exact solutions improved over heuristic methods on several problem instances.