FNphasing: A Novel Fast Heuristic Algorithm for Haplotype Phasing Based on Flow Network Model

  • Authors:
  • Jiaoyun Yang;Yun Xu;Xiaohui Yao;Guoliang Chen

  • Affiliations:
  • University of Science and Technology of China, Hefei;University of Science and Technology of China, Hefei;University of Science and Technology of China, Hefei;University of Science and Technology of China, Hefei

  • Venue:
  • IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

An enormous amount of sequence data has been generated with the development of new DNA sequencing technologies, which presents great challenges for computational biology problems such as haplotype phasing. Although arduous efforts have been made to address this problem, the current methods still cannot efficiently deal with the incoming flood of large-scale data. In this paper, we propose a flow network model to tackle haplotype phasing problem, and explain some classical haplotype phasing rules based on this model. By incorporating the heuristic knowledge obtained from these classical rules, we design an algorithm FNphasing based on the flow network model. Theoretically, the time complexity of our algorithm is $(O(n^2m+m^2))$, which is better than that of 2SNP, one of the most efficient algorithms currently. After testing the performance of FNphasing with several simulated data sets, the experimental results show that when applied on large-scale data sets, our algorithm is significantly faster than the state-of-the-art Beagle algorithm. FNphasing also achieves an equal or superior accuracy compared with other approaches.