Faster computation of the Robinson-Foulds distance between phylogenetic networks

  • Authors:
  • Tetsuo Asano;Jesper Jansson;Kunihiko Sadakane;Ryuhei Uehara;Gabriel Valiente

  • Affiliations:
  • School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan;Ochanomizu University, Tokyo, Japan;National Institute of Informatics, Tokyo, Japan;School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan;Algorithms, Bioinformatics, Complexity and Formal Methods Research Group, Technical University of Catalonia, Barcelona, Spain

  • Venue:
  • CPM'10 Proceedings of the 21st annual conference on Combinatorial pattern matching
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Robinson-Foulds distance, which is the most widely used metric for comparing phylogenetic trees, has recently been generalized to phylogenetic networks. Given two networks N1,N2 with n leaves, m nodes, and e edges, the Robinson-Foulds distance measures the number of clusters of descendant leaves that are not shared by N1 and N2. The fastest known algorithm for computing the Robinson-Foulds distance between those networks runs in O(m(m + e)) time. In this paper, we improve the time complexity to O(n(m+ e)/ log n) for general networks and O(nm/log n) for general networks with bounded degree, and to optimal O(m + e) time for planar phylogenetic networks and bounded-level phylogenetic networks. We also introduce the natural concept of the minimum spread of a phylogenetic network and show how the running time of our new algorithm depends on this parameter. As an example, we prove that the minimum spread of a level-k phylogenetic network is at most k + 1, which implies that for two level-k phylogenetic networks, our algorithm runs in O((k + 1)(m + e)) time.