The role of diverse populations in phylogenetic analysis

Authors:
Tiffani L. Williams;Marc L. Smith
Affiliations:
Texas A&M University, College Station, TX;Colby College, Waterville, ME
Venue:
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Year:
2006

Citing 5
Cited 0

Generative communication in Linda

ACM Transactions on Programming Languages and Systems (TOPLAS)
Solving Large Scale Phylogenetic Problems using DCM2

Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology
Gaphyl: An Evolutionary Algorithms Approach For The Study Of Natural Evolution

GECCO '02 Proceedings of the Genetic and Evolutionary Computation Conference
Rec-I-DCM3: A Fast Algorithmic Technique for Reconstructing Large Phylogenetic Trees

CSB '04 Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference
A GA for maximum likelihood phylogenetic inference using neighbour-joining as a genotype to phenotype mapping

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation

Quantified Score

Hi-index	0.01

Visualization

Abstract

The most popular approaches for reconstructing phylogenetic trees attempt to solve NP-hard optimization criteria such as maximum parsimony (MP). Currently, the best-performing heuristic for reconstructing MP trees is Recursive-Iterative DCM3 (Rec-I-DCM3), which uses a single tree (or solution) to guide its way through an exponentially-sized tree space. To improve performance further, we designed Cooperative Rec-I-DCM3, a population-based approach for utilizing a population of Rec-I-DCM3 trees.We compare the performance of Cooperative Rec-I-DCM3 to Rec-I-DCM3 on four large biological datasets. Of particular interest is Cooperative Rec-I-DCM3's selection criteria for maintaining a population of solutions. Our experiments reveal that diverse populations outperform Rec-I-DCM3 in terms of average rates of convergence to best-known MP scores. To achieve greater performance, we designed an elitist population strategy, in which each solution's tree score matches that of the best score found in each generation. The elitist strategy was by far the worst overall performer in our experiments. Hence, being greedy is not always the best approach. Instead, a population of diverse solutions allows our cooperative algorithm to achieve the greatest performance improvements.