Efficient computation of approximate gene clusters based on reference occurrences

  • Authors:
  • Katharina Jahn

  • Affiliations:
  • Institut für Informatik, Technische Fakultät, Universität Bielefeld, Germany

  • Venue:
  • RECOMB-CG'10 Proceedings of the 2010 international conference on Comparative genomics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Whole genome comparison based on the analysis of gene cluster conservation has become a popular approach in comparative genomics. While gene order and gene content as a whole randomize over time, it is observed that certain groups of genes which are often functionally related remain co-located across species. However, the conservation is usually not perfect which turns the identification of these structures, often referred to as approximate gene clusters, into a challenging task. In this paper, we present a polynomial time algorithm that computes approximate gene clusters based on reference occurrences. We show that our approach yields highly comparable results to a more general approach and allows for approximate gene cluster detection in parameter ranges currently not feasible for non-reference based approaches.