Genome-wide compatible SNP intervals and their properties

Authors:
Jeremy Wang;Kyle J. Moore;Qi Zhang;Fernando Pardo-Manual de Villena;Wei Wang;Leonard McMillan
Affiliations:
University of North Carolina, Chapel Hill, NC;University of North Carolina, Chapel Hill, NC;University of North Carolina, Chapel Hill, NC;University of North Carolina, Chapel Hill, NC;University of North Carolina, Chapel Hill, NC;University of North Carolina, Chapel Hill, NC
Venue:
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
Year:
2010

Citing 14
Cited 1

Reconstructing a history of recombinations from a set of sequences

SODA '94 Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms
Introduction to algorithms

Introduction to algorithms
Haplotyping as perfect phylogeny: conceptual framework and efficient solutions

Proceedings of the sixth annual international conference on Computational biology
Large scale reconstruction of haplotypes from genotype data

RECOMB '03 Proceedings of the seventh annual international conference on Research in computational molecular biology
Perfect phylogeny and haplotype assignment

RECOMB '04 Proceedings of the eighth annual international conference on Resaerch in computational molecular biology
Efficient computation of close lower and upper bounds on the minimum number of recombinations in biological sequence evolution

Bioinformatics
Recombination-filtered genomic datasets by information maximization

Bioinformatics
Genotype Sequence Segmentation: Handling Constraints and Noise

WABI '08 Proceedings of the 8th international workshop on Algorithms in Bioinformatics
Efficient whole-genome association mapping using local phylogenies for unphased genotype data

Bioinformatics
The Multi-State Perfect Phylogeny Problem with Missing and Removable Data: Solutions via Integer-Programming and Chordal Graph Theory

RECOMB 2'09 Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology
Efficiently finding the most parsimonious phylogenetic tree via linear programming

ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
On the complexity of SNP block partitioning under the perfect phylogeny model

WABI'06 Proceedings of the 6th international conference on Algorithms in Bioinformatics
A linear-time algorithm for the perfect phylogeny haplotyping (PPH) problem

RECOMB'05 Proceedings of the 9th Annual international conference on Research in Computational Molecular Biology
Improved algorithms for inferring the minimum mosaic of a set of recombinants

CPM'07 Proceedings of the 18th annual conference on Combinatorial Pattern Matching

Dynamic visualization and comparative analysis of multiple collinear genomic data

Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Intraspecific genomes can be subdivided into blocks with limited diversity. Understanding the distribution and structure of these blocks will help to unravel many biological problems including the identification of genes associated with complex diseases, finding the ancestral origins of a given population, and localizing regions of historical recombination, gene conversion, and homoplasy. We present methods for partitioning a genome into blocks for which there are no apparent recombinations, thus providing parsimonious sets of compatible genome intervals based on the four-gamete test. Our contribution is a thorough analysis of the problem of dividing a genome into compatible intervals, in terms of its computational complexity, and by providing an achievable lower-bound on the minimal number of intervals required to cover an entire data set. In general, such minimal interval partitions are not unique. However, we identify properties that are common to every possible solution. We also define the notion of an interval set that achieves the interval lower-bound, yet maximizes interval overlap. We demonstrate algorithms for partitioning both haplotype data from inbred mice as well as outbred heterozygous genotype data using extensions of the standard four-gamete test. These methods allow our algorithms to be applied to a wide range of genomic data sets.