Efficient Algorithms for SNP Haplotype Block Selection Problems

Authors:
Yaw-Ling Lin
Affiliations:
Dept. Computer Science and Information Engineering, Providence University, Taiwan
Venue:
COCOON '08 Proceedings of the 14th annual international conference on Computing and Combinatorics
Year:
2008

Citing 5
Cited 0

Fast algorithms for finding nearest common ancestors

SIAM Journal on Computing
Algorithms on strings, trees, and sequences: computer science and computational biology

Algorithms on strings, trees, and sequences: computer science and computational biology
An O(nlog n) Algorithm for the Maximum Agreement Subtree Problem for Binary Trees

SIAM Journal on Computing
Model-based inference of haplotype block variation

RECOMB '03 Proceedings of the seventh annual international conference on Research in computational molecular biology
HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms

Bioinformatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms(SNPs) have important implications for identifying disease associations and human traits. Recent genetics research reveals that SNPs within certain haplotype blocks induce only a few distinct common haplotypes in the majority of the population. The existence of haplotype block structure has serious implications for association-based methods for the mapping of disease genes. Our ultimate goal is to select haplotype block designations that best capture the structure within the data.Here in this paper we propose several efficient combinatorial algorithms related to selecting interesting haplotype blocks under different diversity functions that generalizes many previous results in the literatures. In particular, given an m×nhaplotype matrix A, we show linear time algorithms for finding all interval diversities, farthest sites, and the longest block within A. For selecting the multiple long blocks with diversity constraint, we show that selecting kblocks with longest total length can be be found in O(nk) time. We also propose linear time algorithms in calculating the all intra-longest-blocks and all intra-k-longest-blocks.