Learning MHC I—peptide binding
Bioinformatics
Multiple Instance Learning Allows MHC Class II Epitope Predictions Across Alleles
WABI '08 Proceedings of the 8th international workshop on Algorithms in Bioinformatics
Using Gaussian Process with Test Rejection to Detect T-Cell Epitopes in Pathogen Genomes
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hi-index | 0.00 |
Specificity of MHC binding to short peptide fragments from cellular as well as pathogens' proteins has been found to correlate with disease outcome and pathogen or cancer evolution. The large variation in MHC class II epitope length has complicated training of predictors for binding affinities compared to MHC class I. In this paper, we treat the relative position of the peptide inside the MHC protein as a hidden variable, and model the ensemble of different binding configurations. The training procedure iterates the predictions with re estimation of the parameters of a binding groove model. We show that the model generalizes to new MHC class II alleles, which were not a part of the training set. To the best of our knowledge, our technique outperforms all previous approaches to MHC II epitope prediction. We demonstrate how our model can be used to explain previously documented associations between MHC II alleles and disease.