An ensemble learning approach for modeling the systems biology of drug-induced injury

Autor: Giulia Callegaro, Ferran Sanz, Narcis Fernandez-Fuentes, Janet Piñero, Terezinha Souza, Joaquim Aguirre-Plans, Steven J. Kunnen, Laura I. Furlong, Baldo Oliva, Emre Guney
Přispěvatelé: Toxicogenomics, RS: GROW - R1 - Prevention
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Drug
EXPRESSION
GENES
Drug-induced liver injury
Drug-Related Side Effects and Adverse Reactions
PREDICTION
media_common.quotation_subject
Systems biology
Immunology
Drug structure
Computational biology
Biology
Models
Biological

General Biochemistry
Genetics and Molecular Biology

Machine Learning
03 medical and health sciences
0302 clinical medicine
Feature (machine learning)
Humans
Liver damage
Drug safety
lcsh:QH301-705.5
Ecology
Evolution
Behavior and Systematics

030304 developmental biology
media_common
RISK
0303 health sciences
INDUCED LIVER-INJURY
Applied Mathematics
Research
Systems Biology
Hepatotoxicity
Robustness (evolution)
PLATFORM
Gene signature
Ensemble learning
3. Good health
Cmap
lcsh:Biology (General)
Pharmaceutical Preparations
030220 oncology & carcinogenesis
Modeling and Simulation
CONNECTIVITY MAP
Research studies
Chemical and Drug Induced Liver Injury
General Agricultural and Biological Sciences
CAMDA
Zdroj: Biology Direct
Biology Direct, 16(1):5. BioMed Central
Biology Direct, Vol 16, Iss 1, Pp 1-14 (2021)
ISSN: 1745-6150
Popis: Background: Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the pathophysiology and mechanisms of DILI are poorly understood. In the present study, we developed an ensemble learning approach based on different features (CMap gene expression, chemical structures, drug targets) to predict drugs that might cause DILI and gain a better understanding of the mechanisms linked to the adverse reaction. Results: We searched for gene signatures in CMap gene expression data by using two approaches: phenotype-gene associations data from DisGeNET, and a non-parametric test comparing gene expression of DILI-Concern and No-DILI-Concern drugs (as per DILIrank definitions). The average accuracy of the classifiers in both approaches was 69%. We used chemical structures as features, obtaining an accuracy of 65%. The combination of both types of features produced an accuracy around 63%, but improved the independent hold-out test up to 67%. The use of drug-target associations as feature obtained the best accuracy (70%) in the independent hold-out test. Conclusions: When using CMap gene expression data, searching for a specific gene signature among the landmark genes improves the quality of the classifiers, but it is still limited by the intrinsic noise of the dataset. When using chemical structures as a feature, the structural diversity of the known DILI-causing drugs hampers the prediction, which is a similar problem as for the use of gene expression information. The combination of both features did not improve the quality of the classifiers but increased the robustness as shown on independent hold-out tests. The use of drug-target associations as feature improved the prediction, specially the specificity, and the results were comparable to previous research studies. The authors received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreements TransQST and eTRANSAFE (refs: 116030, 777365). This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA companies in kind contribution. The authors also received support from Spanish Ministry of Economy (MINECO, refs: BIO2017–85329-R (FEDER, EU), RYC-2015-17519) as well as EU H2020 Programme 2014–2020 under grant agreement No. 676559 (Elixir-Excelerate) and from Agència de Gestió D’ajuts Universitaris i de Recerca Generalitat de Catalunya (AGAUR, ref.: 2017SGR01020). L.I.F. received support from ISCIII-FEDER (ref: CPII16/00026). The Research Programme on Biomedical Informatics (GRIB) is a member of the Spanish National Bioinformatics Institute (INB), PRB2-ISCIII and is supported by grant PT13/0001/0023, of the PE I + D + i 2013–2016, funded by ISCIII and FEDER. The DCEXS is a “Unidad de Excelencia María de Maeztu”, funded by the MINECO (ref: MDM-2014-0370). J.A.P. received support from the CAMDA Travel Fellowship.
Databáze: OpenAIRE