AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands

Autor:	Ayan Chatterjee, Robin Walters, Zohair Shafi, Omair Shafi Ahmed, Michael Sebek, Deisy Gysi, Rose Yu, Tina Eliassi-Rad, Albert-László Barabási, Giulia Menichetti
Přispěvatelé:	Ayan Chatterjee, Michael Sebek
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning FOS: Biological sciences Quantitative Biology - Quantitative Methods Drug discovery Machine learning Protein binding XAI Interpretability Quantitative Methods (q-bio.QM) Machine Learning (cs.LG)
Popis:	Identifying novel drug-target interactions (DTI) is a critical and rate limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We first unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Then, we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training, allowing us to limit the annotation imbalance and improve binding predictions for novel proteins and ligands. We illustrate the value of AI-Bind by predicting drugs and natural compounds with binding affinity to SARS-CoV-2 viral proteins and the associated human proteins. We also validate these predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. Overall, AI-Bind offers a powerful high-throughput approach to identify drug-target combinations, with the potential of becoming a powerful tool in drug discovery. 83 pages, 26 figures, all references moved to a single section, new results added on AI interpretability, added comparison with MolTrans, added validation using gold standard experimental data
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::85a8378f02f25571b01e64a6d371d41f http://arxiv.org/abs/2112.13168 Zobrazit plný text záznamu