Provably Fast and Accurate Recovery of Evolutionary Trees through Harmonic Greedy Triplets

  • Authors:
  • Miklós Csurös;Ming-Yang Kao

  • Affiliations:
  • -;-

  • Venue:
  • SIAM Journal on Computing
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We give a greedy learning algorithm for reconstructing an evolutionary tree based on a certain harmonic average on triplets of terminal taxa. After the pairwise distances between terminal taxa are estimated from sequence data, the algorithm runs in $\smallbigO{\numtaxa^2}$ time using $\smallbigO{\numtaxa}$ work space, where $\numtaxa$ is the number of terminal taxa. These time and space complexities are optimal in the sense that the size of an input distance matrix is $\numtaxa^2$ and the size of an output tree is $\numtaxa$. Moreover, in the Jukes--Cantor model of evolution, the algorithm recovers the correct tree topology with high probability using sample sequences of length polynomial in (1) $\numtaxa$, (2) the logarithm of the error probability, and (3) the inverses of two small parameters.