Non-breaking similarity of genomes with gene repetitions

  • Authors:
  • Zhixiang Chen;Bin Fu;Jinhui Xu;Boting Yang;Zhiyu Zhao;Binhai Zhu

  • Affiliations:
  • Department of Computer Science, University of Texas-American, Edinburg, TX;Department of Computer Science, University of Texas-American, Edinburg, TX;Department of Computer Science, SUNY-Buffalo, Buffalo, NY;Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada;Department of Computer Science, University of New Orleans, New Orleans, LA;Department of Computer Science, Montana State University, Bozeman, MT

  • Venue:
  • CPM'07 Proceedings of the 18th annual conference on Combinatorial Pattern Matching
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we define a new similarity measure, the nonbreaking similarity, which is the complement of the famous breakpoint distance between genomes (in general, between any two sequences drawn from the same alphabet). When the two input genomes G and H, drawn from the same set of n gene families, contain gene repetitions, we consider the corresponding Exemplar Non-breaking Similarity problem (ENbS) in which we need to delete repeated genes in G and H such that the resulting genomes G and H have the maximum non-breaking similarity. We have the following results. - For the Exemplar Non-breaking Similarity problem, we prove that the Independent Set problem can be linearly reduced to this problem. Hence, ENbS does not admit any factor-n1-ε polynomial-time approximation unless P=NP. (Also, ENbS is W[1]-complete.) - We show that for several practically interesting cases of the Exemplar Non-breaking Similarity problem, there are polynomial time algorithms.