Hunting for Functionally Analogous Genes

  • Authors:
  • Michael T. Hallett;Jens Lagergren

  • Affiliations:
  • -;-

  • Venue:
  • FST TCS 2000 Proceedings of the 20th Conference on Foundations of Software Technology and Theoretical Computer Science
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Evidence indicates that members of many gene families in the genome of an organism tend to have homologues both within their own genome and in the genomes of other organisms. Amongst these homologues, typically only one or a few per genome perform an analogous function in their genome. Finding subsets of these genes which show evidence of performing a common function is an important first step towards, for instance, the creation of phylogenetic trees, multiple sequence alignments and secondary structure predictions. Given a collection of taxa P = {P1, P2, ..., Pk} where Pi contains genes {pi, 1, pi, 2, ..., pi, ni}, we ask to choose one gene from each of the taxa Pi such that these chosen vertices most agree. We define most agreeing in three distinct ways: most tree-like, pairwise closest, and pairwise most similar. We show these problems to be computationally hard from almost every angle via classical, parameterized and approximation complexity theory. However, on the positive side, we give randomized approximation algorithms following ideas from [GGR98] for the pairwise closest and pairwise most similar variants.