Linear time 3-approximation for the MAST problem

  • Authors:
  • Vincent Berry;Christophe Paul;Sylvain Guillemot;François Nicolas

  • Affiliations:
  • Université Montpellier II, France;Université Montpellier II, France;Université Montpellier II, France;Université Montpellier II, France

  • Venue:
  • ACM Transactions on Algorithms (TALG)
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a set of leaf-labeled trees with identical leaf sets, the well-known Maximum Agreement SubTree (MAST) problem consists in finding a subtree homeomorphically included in all input trees and with the largest number of leaves. MAST and its variant called Maximum Compatible Tree (MCT) are of particular interest in computational biology. This article presents a linear-time approximation algorithm to solve the complement version of MAST, namely identifying the smallest set of leaves to remove from input trees to obtain isomorphic trees. We also present an O(n2 + kn) algorithm to solve the complement version of MCT. For both problems, we thus achieve significantly lower running times than previously known algorithms. Fast running times are especially important in phylogenetics where large collections of trees are routinely produced by resampling procedures, such as the nonparametric bootstrap or Bayesian MCMC methods.