On the complexity of SNP block partitioning under the perfect phylogeny model

Authors:
Jens Gramm;Tzvika Hartman;Till Nierhoff;Roded Sharan;Till Tantau
Affiliations:
Wilhelm-Schickard-Institut für Informatik, Universität Tübingen, Germany;Dept. of Computer Science, Bar-Ilan University, Ramat-Gan, Israel;International Computer Science Institute, Berkeley;School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel;Institut für Theoretische Informatik, Universität zu Lübeck, Germany
Venue:
WABI'06 Proceedings of the 6th international conference on Algorithms in Bioinformatics
Year:
2006

Citing 6
Cited 1

On the hardness of approximating minimization problems

Journal of the ACM (JACM)
Haplotyping as perfect phylogeny: conceptual framework and efficient solutions

Proceedings of the sixth annual international conference on Computational biology
The Hardness of 3 - Uniform Hypergraph Coloring

FOCS '02 Proceedings of the 43rd Symposium on Foundations of Computer Science
Perfect phylogeny and haplotype assignment

RECOMB '04 Proceedings of the eighth annual international conference on Resaerch in computational molecular biology
Haplotyping with missing data via perfect path phylogenies

Discrete Applied Mathematics
Empirical exploration of perfect phylogeny haplotyping and haplotypers

COCOON'03 Proceedings of the 9th annual international conference on Computing and combinatorics

Genome-wide compatible SNP intervals and their properties

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent technologies for typing single nucleotide polymorphisms (SNPs) across a population are producing genome-wide genotype data for tens of thousands of SNP sites. The emergence of such large data sets underscores the importance of algorithms for large-scale haplotyping. Common haplotyping approaches first partition the SNPs into blocks of high linkage-disequilibrium, and then infer haplotypes for each block separately. We investigate an integrated haplotyping approach where a partition of the SNPs into a minimum number of non-contiguous subsets is sought, such that each subset can be haplotyped under the perfect phylogeny model. We show that finding an optimum partition is NP-hard even if we are guaranteed that two subsets suffice. On the positive side, we show that a variant of the problem, in which each subset is required to admit a perfect path phylogeny haplotyping, is solvable in polynomial time.