OMG! orthologs in multiple genomes: competing graph-theoretical formulations

  • Authors:
  • Chunfang Zheng;Krister Swenson;Eric Lyons;David Sankoff

  • Affiliations:
  • Department of Mathematics and Statistics, University of Ottawa and Département d'Informatique et de Recherche Opérationnelle, Université de Montréal;Department of Mathematics and Statistics, University of Ottawa and Département d'Informatique et de Recherche Opérationnelle, Université de Montréal;iPlant, Department of Plant Sciences, University of Arizona;Department of Mathematics and Statistics, University of Ottawa

  • Venue:
  • WABI'11 Proceedings of the 11th international conference on Algorithms in bioinformatics
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

From the set of all pairwise homologies, weighted by sequence similarities, among a set of genomes, we seek disjoint orthology sets of genes, in which each element is orthogonal to all other genes (on a different genome) in the same set. In a graph-theoretical formulation, where genes are vertices and weighted edges represent homologies, we suggest three criteria, with three different biological motivations, for evaluating the partition of genes produced by deletion of a subset of edges: i) minimum weight edge removal, ii) minimum degree-zero vertex creation, and iii) maximum number of edges in the transitive closure of the graph after edge deletion. For each of the problems, all either proved or conjectured to be NP-hard, we suggest approximate and heuristic algorithms of finding orthology sets satisfying the criteria, and show how to incorporate genomes that have a whole genome duplication event in their immediate lineage. We apply this to ten flowering plant genomes, involving 160,000 different genes in given pairwise homologies. We evaluate the results in a number of ways and recommend criterion iii) as best suited to applications to multiple gene order alignment.