A probabilistic knowledge graph for target identification.

Autor: Liu, Chang, Xiao, Kaimin, Yu, Cuinan, Lei, Yipin, Lyu, Kangbo, Tian, Tingzhong, Zhao, Dan, Zhou, Fengfeng, Tang, Haidong, Zeng, Jianyang
Předmět:
Zdroj: PLoS Computational Biology; 4/5/2024, Vol. 20 Issue 4, p1-23, 23p
Abstrakt: Early identification of safe and efficacious disease targets is crucial to alleviating the tremendous cost of drug discovery projects. However, existing experimental methods for identifying new targets are generally labor-intensive and failure-prone. On the other hand, computational approaches, especially the machine learning-based frameworks, have shown remarkable application potential in drug discovery. In this work, we propose Progeni, a novel machine learning-based framework for target identification. In addition to fully exploiting the known heterogeneous biological networks from various sources, Progeni integrates literature evidence about the relations between biological entities to construct a probabilistic knowledge graph. Graph neural networks are then employed in Progeni to learn the feature embeddings of biological entities to facilitate the identification of biologically relevant target candidates. A comprehensive evaluation of Progeni demonstrated its superior predictive power over the baseline methods on the target identification task. In addition, our extensive tests showed that Progeni exhibited high robustness to the negative effect of exposure bias, a common phenomenon in recommendation systems, and effectively identified new targets that can be strongly supported by the literature. Moreover, our wet lab experiments successfully validated the biological significance of the top target candidates predicted by Progeni for melanoma and colorectal cancer. All these results suggested that Progeni can identify biologically effective targets and thus provide a powerful and useful tool for advancing the drug discovery process. Author summary: The selection of a disease target, a biological entity with which potential drugs interact to treat diseases, is often the first step of a drug discovery project. Known to be extremely costly, drug discovery projects spend tremendous resources on failed drug instances, most of which result from inadequate choices of targets. To help address the costliness problem of drug discovery, we developed a machine learning-based framework that identifies biologically effective targets using a probabilistic knowledge graph built from both biological network data and literature evidence. Our method not only outperforms state-of-the-art baseline methods on the target prediction task, but can also identify targets with high biological relevance, as shown by the strong support of the literature for the predicted target candidates and wet lab experiments that validate the significance of the predicted target candidates for melanoma and colorectal cancer, These results suggest that our method can identify effective targets and therefore benefit drug discovery. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index
Nepřihlášeným uživatelům se plný text nezobrazuje