Haplotype-based prediction of gene alleles using pedigrees and SNP genotypes

  • Authors:
  • Yuri Pirola;Gianluca Della Vedova;Paola Bonizzoni;Alessandra Stella;Filippo Biscarini

  • Affiliations:
  • DISCo, Univ. degli Studi di Milano--Bicocca, Viale Sarca 336, Milan, Italy;DISCo, Univ. degli Studi di Milano--Bicocca, Viale Sarca 336, Milan, Italy;DISCo, Univ. degli Studi di Milano--Bicocca, Viale Sarca 336, Milan, Italy;CeRSA, Parco Tecnologico Padano, Loc. Cascina Codazza, Lodi, Italy;CeRSA, Parco Tecnologico Padano, Loc. Cascina Codazza, Lodi, Italy

  • Venue:
  • Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Computational methods for gene allele prediction have been proposed to substitute dedicated and expensive assays with cheaper in-silico analyses that operate on routinely collected data, such as SNP genotypes. Most of these methods are tailored to the needs and characteristics of human genetic studies where they achieve good prediction accuracy. However, genomic analyses are becoming increasingly important in livestock species too. For livestock species generally the underlying---usually quite large and complex---pedigree is known and available; this information is not fully exploited by current allele prediction methods. In this paper, we propose a new gene allele prediction method based on a simple, but robust, combinatorial formulation for the problem of discovering haplotype-allele associations. The inherent uncertainty of the haplotype inference process is reduced by taking into account the inheritance of gene alleles across the population pedigree while genotypes are phased. The accuracy of the method has been extensively evaluated on a representative real-world livestock dataset under several scenarios and choices of parameters. The median error rate ranged from 0.0537 to 0.0896, with an average of 0.0678; this is 21% better than another state-of-the-art prediction algorithm that does not use the pedigree information. The experimental results support the validity of the proposed approach and, in particular, of the use of pedigree information in gene allele predictions.