Computational Complexity of Perfect-Phylogeny-Related Haplotyping Problems

  • Authors:
  • Michael Elberfeld;Till Tantau

  • Affiliations:
  • Institut für Theoretische Informatik, Universität zu Lübeck, Lübeck, Germany 23538;Institut für Theoretische Informatik, Universität zu Lübeck, Lübeck, Germany 23538

  • Venue:
  • MFCS '08 Proceedings of the 33rd international symposium on Mathematical Foundations of Computer Science
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Haplotyping, also known as haplotype phase prediction, is the problem of predicting likely haplotypes based on genotype data. This problem, which has strong practical applications, can be approached using both statistical as well as combinatorial methods. While the most direct combinatorial approach, maximum parsimony, leads to NP-complete problems, the perfect phylogeny model proposed by Gusfield yields a problem, called pph, that can be solved in polynomial (even linear) time. Even this may not be fast enough when the whole genome is studied, leading to the question of whether parallel algorithms can be used to solve the pphproblem. In the present paper we answer this question affirmatively, but we also give lower complexity bounds on its complexity. In detail, we show that the problem lies in Mod2L, a subclass of the circuit complexity class NC2, and is hard for logarithmic space and thus presumably not in NC1. We also investigate variants of the pphproblem that have been studied in the literature, like the perfect path phylogeny haplotyping problem and the combined problem where a perfect phylogeny of maximal parsimony is sought, and show that some of these variants are TC0-complete or lie in AC0.