On the hardness of approximating minimization problems
Journal of the ACM (JACM)
Haplotyping as perfect phylogeny: conceptual framework and efficient solutions
Proceedings of the sixth annual international conference on Computational biology
The Hardness of 3 - Uniform Hypergraph Coloring
FOCS '02 Proceedings of the 43rd Symposium on Foundations of Computer Science
Perfect phylogeny and haplotype assignment
RECOMB '04 Proceedings of the eighth annual international conference on Resaerch in computational molecular biology
Haplotyping with missing data via perfect path phylogenies
Discrete Applied Mathematics
Empirical exploration of perfect phylogeny haplotyping and haplotypers
COCOON'03 Proceedings of the 9th annual international conference on Computing and combinatorics
Genome-wide compatible SNP intervals and their properties
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
Hi-index | 0.00 |
Recent technologies for typing single nucleotide polymorphisms (SNPs) across a population are producing genome-wide genotype data for tens of thousands of SNP sites. The emergence of such large data sets underscores the importance of algorithms for large-scale haplotyping. Common haplotyping approaches first partition the SNPs into blocks of high linkage-disequilibrium, and then infer haplotypes for each block separately. We investigate an integrated haplotyping approach where a partition of the SNPs into a minimum number of non-contiguous subsets is sought, such that each subset can be haplotyped under the perfect phylogeny model. We show that finding an optimum partition is NP-hard even if we are guaranteed that two subsets suffice. On the positive side, we show that a variant of the problem, in which each subset is required to admit a perfect path phylogeny haplotyping, is solvable in polynomial time.