Haplotype Inference Constrained by Plausible Haplotype Data

  • Authors:
  • Michael R. Fellows;Tzvika Hartman;Danny Hermelin;Gad M. Landau;Frances Rosamond;Liat Rozenberg

  • Affiliations:
  • The University of Newcastle, Newcastle;Google, Tel Aviv;Max Planck Institute for Informatics, Saarbrucken;University of Haifa, Haifa;The University of Newcastle, Newcastle;University of Haifa, Haifa

  • Venue:
  • IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The haplotype inference problem (HIP) asks to find a set of haplotypes which resolve a given set of genotypes. This problem is important in practical fields such as the investigation of diseases or other types of genetic mutations. In order to find the haplotypes which are as close as possible to the real set of haplotypes that comprise the genotypes, two models have been suggested which are by now well-studied: The perfect phylogeny model and the pure parsimony model. All known algorithms up till now for haplotype inference may find haplotypes that are not necessarily plausible, i.e., very rare haplotypes or haplotypes that were never observed in the population. In order to overcome this disadvantage, we study in this paper, a new constrained version of HIP under the above-mentioned models. In this new version, a pool of plausible haplotypes \widetilde{H} is given together with the set of genotypes G, and the goal is to find a subset H \subseteq \widetilde{H} that resolves G. For constrained perfect phylogeny haplotyping (CPPH), we provide initial insights and polynomial-time algorithms for some restricted cases of the problem. For constrained parsimony haplotyping (CPH), we show that the problem is fixed parameter tractable when parameterized by the size of the solution set of haplotypes.