A framework for orthology assignment from gene rearrangement data

  • Authors:
  • Krister M. Swenson;Nicholas D. Pattengale;B. M. E. Moret

  • Affiliations:
  • Department of Computer Science, University of New Mexico, Albuquerque, NM;Department of Computer Science, University of New Mexico, Albuquerque, NM;Department of Computer Science, University of New Mexico, Albuquerque, NM

  • Venue:
  • RCG'05 Proceedings of the 2005 international conference on Comparative Genomics
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

Gene rearrangements have been used successfully in phylogenetic reconstruction and comparative genomics, but usually under the assumption that all genomes have the same gene content and that no gene is duplicated. While these assumptions allow one to work with organellar genomes, they are too restrictive for nuclear genomes. The main challenge in handling more realistic data is how to deal with gene families, specifically, how to identify orthologs. While searching for orthologies is a common task in computational biology, it is usually done using sequence data. Sankoff first addressed the problem in 1999, introducing the notion of exemplar, but his approach uses an NP-hard optimization step to discard all but one member (the exemplar) of each gene family, losing much valuable information in the process. We approach the problem using all available data in the gene orders and gene families, provide an optimization framework in which to phrase the problem, and present some preliminary theoretical results.