A practical parameterised algorithm for the individual haplotyping problem mlf†

  • Authors:
  • Minzhu Xie;Jianxin Wang;Jianer Chen

  • Affiliations:
  • School of information science and engineering, central south university, changsha 410083, p.r.china and college of physics and information science, hunan normal university, changsha 410081, p.r.ch ...;School of information science and engineering, central south university, changsha 410083, p.r.china email: jxwang@mail.csu.edu.cn;School of information science and engineering, central south university, changsha 410083, p.r.china and department of computer science, texas a&m university, college station, tx 77843, u.s.a. ...

  • Venue:
  • Mathematical Structures in Computer Science
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Haplotypes are more useful in complex disease gene mapping than single-nucleotide polymorphisms (SNPs). However, haplotypes are difficult to obtain directly using biological experiments, which has prompted research into efficient computational methods for determining haplotypes. The individual haplotyping problem called Minimum Letter Flip (MLF) is a computational problem that, given a set of aligned DNA sequence fragment data of an individual, induces the corresponding haplotypes by flipping minimum SNPs. There has been no practical exact algorithm for solving the problem. Due to technical limits in DNA sequencing experiments, the maximum length of a fragment sequenced directly is about 1kb. In consequence, with a genome-average SNP density of 1.84 SNPs per 1 kb of DNA sequence, the maximum number k1 of SNP sites that a fragment covers is usually small. Moreover, in order to save time and money, the maximum number k2 of fragments that cover an SNP site is usually no more than 19. Building on these fragment data properties, the current paper introduces a new parameterised algorithm with running time O(nk22k2 + mlogm + mk1), where m is the number of fragments and n is the number of SNP sites. In practical biological applications, the algorithm solves the MLF problem efficiently even if m and n are large.