Inferring positional homologs with common intervals of sequences

  • Authors:
  • Guillaume Blin;Annie Chateau;Cedric Chauve;Yannick Gingras

  • Affiliations:
  • IGM-LabInfo – UMR CNRS 8049, Université Marne-la-Vallée, Marne-la-Vallée, France;LaCIM, Université du Québec À Montréal, Montréal, (QC), Canada;LaCIM, Université du Québec À Montréal, Montréal, (QC), Canada;CGL, Université du Québec À, Montréal

  • Venue:
  • RCG'06 Proceedings of the RECOMB 2006 international conference on Comparative Genomics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Inferring orthologous and paralogous genes is an important problem in whole genomes comparisons, both for functional or evolutionary studies. In this paper, we introduce a new approach for inferring candidate pairs of orthologous genes between genomes, also called positional homologs, based on the conservation of the genomic context. We consider genomes represented by their gene order – i.e. sequences of signed integers – and common intervals of these sequences as the anchors of the final gene matching. We show that the natural combinatorial problem of computing a maximal cover of the two genomes using the minimum number of common intervals is NP-complete and we give a simple heuristic for this problem. We illustrate the effectiveness of this first approach using common intervals of sequences on two datasets, respectively 8 γ-proteobacterial genomes and the human and mouse whole genomes.