Heuristics for the gene-duplication problem: a Θ(n) speed-up for the local search

Authors:
Mukul S. Bansal;J. Gordon Burleigh;Oliver Eulenstein;André Wehe
Affiliations:
Department of Computer Science, Iowa State University, Ames, IA;National Evolutionary Synthesis Center, Durham, NC;Department of Computer Science, Iowa State University, Ames, IA;Department of Electrical and Computer Engineering, Iowa State University, Ames, IA
Venue:
RECOMB'07 Proceedings of the 11th annual international conference on Research in computational molecular biology
Year:
2007

Citing 5
Cited 16

Fast algorithms for finding nearest common ancestors

SIAM Journal on Computing
On reconstructing species trees from gene trees in term of duplications and losses

RECOMB '98 Proceedings of the second annual international conference on Computational molecular biology
New algorithms for the duplication-loss model

RECOMB '00 Proceedings of the fourth annual international conference on Computational molecular biology
The LCA Problem Revisited

LATIN '00 Proceedings of the 4th Latin American Symposium on Theoretical Informatics
Reconciling gene trees to a species tree

CIAC'03 Proceedings of the 5th Italian conference on Algorithms and complexity

An Ω(n^2/ log n) Speed-Up of TBR Heuristics for the Gene-Duplication Problem

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
The Gene-Duplication Problem: Near-Linear Time Algorithms for NNI-Based Local Searches

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
New Perspectives on Gene Family Evolution: Losses in Reconciliation and a Link with Supertrees

RECOMB 2'09 Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology
A scalable parallelization of the gene duplication problem

Journal of Parallel and Distributed Computing
A heuristic algorithm for reconstructing ancestral gene orders with duplications

RECOMB-CG'07 Proceedings of the 2007 international conference on Comparative genomics
The gene-duplication problem: near-linear time algorithms for NNI based local searches

ISBRA'08 Proceedings of the 4th international conference on Bioinformatics research and applications
Inferring species trees from gene duplication episodes

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
Linear-Time Algorithms for the Multiple Gene Duplication Problems

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
H-trees: a Model of Evolutionary Scenarios with Horizontal Gene Transfer

Fundamenta Informaticae - From Mathematical Beauty to the Truth of Nature: to Jerzy Tiuryn on his 60th Birthday
An approximation algorithm for computing a parsimonious first speciation in the gene duplication model

RECOMB-CG'10 Proceedings of the 2010 international conference on Comparative genomics
A linear time algorithm for error-corrected reconciliation of unrooted gene trees

ISBRA'11 Proceedings of the 7th international conference on Bioinformatics research and applications
Complexity insights of the minimum duplication problem

SOFSEM'12 Proceedings of the 38th international conference on Current Trends in Theory and Practice of Computer Science
Inferring evolutionary scenarios in the duplication, loss and horizontal gene transfer model

Logic and Program Semantics
GTP supertrees from unrooted gene trees: linear time algorithms for NNI based local searches

ISBRA'12 Proceedings of the 8th international conference on Bioinformatics Research and Applications
An Ω(n2/ log n) speed-up of TBR heuristics for the gene-duplication problem

WABI'07 Proceedings of the 7th international conference on Algorithms in Bioinformatics
Efficient Algorithms for Knowledge-Enhanced Supertree and Supermatrix Phylogenetic Problems

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The gene-duplication problem is to infer a species supertree from a collection of gene trees that are confounded by complex histories of gene duplications. This problem is NP-hard and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. We show how this local search problem can be solved efficiently by reusing previously computed information. This improves the running time of the current solution by a factor of n, where n is the number of species in the resulting supertree solution, and makes the gene-duplication problem more tractable for large-scale phylogenetic analyses. We verify the exceptional performance of our solution in a comparison study using sets of large randomly generated gene trees. Furthermore, we demonstrate the utility of our solution by incorporating large genomic data sets from GenBank into a supertree analysis of plants.