The maximum agreement forest problem: Approximation algorithms and computational experiments

  • Authors:
  • Estela M. Rodrigues;Marie-France Sagot;Yoshiko Wakabayashi

  • Affiliations:
  • Universidade de São Paulo, Departamento de Ciência da Computação, Rua do Matão, 1010, 05508-090São Paulo, Brazil;INRIA Rhône-Alpes, Université Claude Bernard, Lyon I, 43, Bd du 11 novembre 1918, 69622 Villeurbanne cedex, France;Universidade de São Paulo, Departamento de Ciência da Computação, Rua do Matão, 1010, 05508-090São Paulo, Brazil

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2007

Quantified Score

Hi-index 5.23

Visualization

Abstract

There are various techniques for reconstructing phylogenetic trees from data, and in this context the problem of determining how distant two such trees are from each other arises naturally. Various metrics for measuring the distance between two phylogenies have been defined. Another way of comparing two trees T and U is to compute the so called maximum agreement forest of these trees. Informally, the number of components of an agreement forest tells how many edges from each of T and U need to be cut so that the resulting forests agree, after performing all forced edge contractions. This problem is NP-hard even when the input trees have maximum degree 2. Hein et al. [J. Hein, T. Jiang, L. Wang, K. Zhang, On the complexity of comparing evolutionary trees, Discrete Applied Mathematics 71 (1996) 153-169] presented an approximation algorithm for it, claimed to have performance ratio 3. We show that the performance ratio of the algorithm proposed by Hein et al. is 4, and we also present two new 3-approximation algorithms for this problem. We show how to modify one of the algorithms into a (d+1)-approximation algorithm for trees with bounded degree d, d=2. Finally, we report on some computational experiments comparing the performance of the algorithms presented in this paper.