Diverse approaches to predicting drug-induced liver injury using gene-expression profiles

Autor:	Emi Ford, G. Rex Sumsion, Jeremy T. Beales, Daniel J. Garrett, Ifeanyichukwu O. Nwosu, Stephen R. Piccolo, Griffin R. G. Caryotakis, Michael Bradshaw, Emily D. LeBaron
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	Drug Normalization (statistics) media_common.quotation_subject Immunology Drug development Computational biology Biology Models Biological Risk Assessment General Biochemistry Genetics and Molecular Biology 03 medical and health sciences Drug withdrawal 0302 clinical medicine Machine learning medicine Humans lcsh:QH301-705.5 Ecology Evolution Behavior and Systematics 030304 developmental biology media_common Hyperparameter 0303 health sciences Applied Mathematics Research Gene Expression Profiling Precision medicine medicine.disease Classification Statistical classification lcsh:Biology (General) 030220 oncology & carcinogenesis Modeling and Simulation Test set Cell lines Chemical and Drug Induced Liver Injury General Agricultural and Biological Sciences Transcriptome Algorithms
Zdroj:	Biology Direct Biology Direct, Vol 15, Iss 1, Pp 1-12 (2020)
ISSN:	1745-6150
Popis:	Background Drug-induced liver injury (DILI) is a serious concern during drug development and the treatment of human disease. The ability to accurately predict DILI risk could yield significant improvements in drug attrition rates during drug development, in drug withdrawal rates, and in treatment outcomes. In this paper, we outline our approach to predicting DILI risk using gene-expression data from Build 02 of the Connectivity Map (CMap) as part of the 2018 Critical Assessment of Massive Data Analysis CMap Drug Safety Challenge. Results First, we used seven classification algorithms independently to predict DILI based on gene-expression values for two cell lines. Similar to what other challenge participants observed, none of these algorithms predicted liver injury on a consistent basis with high accuracy. In an attempt to improve accuracy, we aggregated predictions for six of the algorithms (excluding one that had performed exceptionally poorly) using a soft-voting method. This approach also failed to generalize well to the test set. We investigated alternative approaches—including a multi-sample normalization method, dimensionality-reduction techniques, a class-weighting scheme, and expanding the number of hyperparameter combinations used as inputs to the soft-voting method. We met limited success with each of these solutions. Conclusions We conclude that alternative methods and/or datasets will be necessary to effectively predict DILI in patients based on RNA expression levels in cell lines. Reviewers This article was reviewed by Paweł P Labaj and Aleksandra Gruca (both nominated by David P Kreil).
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1dffa6fb690cca6352f908ec7b05175d http://europepmc.org/articles/PMC6961287 Zobrazit plný text záznamu Full text from SpringerLink