Evaluation of Label Dependency for the Prediction of HLA Genes

  • Authors:
  • Vanja Paunić;Michael Steinbach;Abeer Madbouly;Vipin Kumar

  • Affiliations:
  • Dept. of Computer Science, University of Minnesota, Minneapolis, MN, 55455;Dept. of Computer Science, University of Minnesota, Minneapolis, MN, 55455;Bioinformatics Research, National Marrow Donor Program, Minneapolis, MN, 55413;Dept. of Computer Science, University of Minnesota, Minneapolis, MN, 55455

  • Venue:
  • Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Human Leukocyte Antigen (HLA) gene system plays a crucial role in hematopoietic stem cell transplantation, where patients and donors are matched with respect to their HLA genes in order to maximize the chances of a successful transplant. It is the most polymorphic region of the human genome with some of the strongest associations with autoimmune, infectious, and inflammatory diseases. The availability of HLA data is, therefore, of high importance to clinicians and researchers. However, due to its high polymorphism, obtaining it is time- and cost-prohibitive. We previously described a method for the prediction of HLA genes from widely available Single Nucleotide Polymorphism (SNP) data. In this paper we show that using HLA gene dependency information improves prediction performance on multiple real-world data sets. More specifically, we propose and evaluate different approaches for integrating HLA gene dependency into the prediction process. The results from experiments on two real data sets show that adding dependency information is a valuable asset for HLA gene prediction, particularly for smaller data sets.