Large-Scale Structure-Based Prediction of Stable Peptide Binding to Class I HLAs Using Random Forests

Autor:	Jayvee R. Abella, Dinler A. Antunes, Cecilia Clementi, Lydia E. Kavraki
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	structural modeling random forests machine learning HLA-I peptide binding docking Immunologic diseases. Allergy RC581-607
Zdroj:	Frontiers in Immunology, Vol 11 (2020)
Druh dokumentu:	article
ISSN:	1664-3224
DOI:	10.3389/fimmu.2020.01583
Popis:	Prediction of stable peptide binding to Class I HLAs is an important component for designing immunotherapies. While the best performing predictors are based on machine learning algorithms trained on peptide-HLA (pHLA) sequences, the use of structure for training predictors deserves further exploration. Given enough pHLA structures, a predictor based on the residue-residue interactions found in these structures has the potential to generalize for alleles with little or no experimental data. We have previously developed APE-Gen, a modeling approach able to produce pHLA structures in a scalable manner. In this work we use APE-Gen to model over 150,000 pHLA structures, the largest dataset of its kind, which were used to train a structure-based pan-allele model. We extract simple, homogenous features based on residue-residue distances between peptide and HLA, and build a random forest model for predicting stable pHLA binding. Our model achieves competitive AUROC values on leave-one-allele-out validation tests using significantly less data when compared to popular sequence-based methods. Additionally, our model offers an interpretation analysis that can reveal how the model composes the features to arrive at any given prediction. This interpretation analysis can be used to check if the model is in line with chemical intuition, and we showcase particular examples. Our work is a significant step toward using structure to achieve generalizable and more interpretable prediction for stable pHLA binding.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/2aa4c6caeb2b4ad38938b142fb83a03e Zobrazit plný text záznamu View record in DOAJ