A machine learning-based treatment prediction model using whole genome variants of hepatitis C virus
Autor: | Ayumi Koseki, Taketo Nishina, Tomohiro Katsumi, Kyoko Hoshikawa, Takafumi Saito, Hidenori Sato, Hiroaki Haga, Yoshiyuki Ueno, Kei Mizuno, Kazuo Okumoto |
---|---|
Rok vydání: | 2020 |
Předmět: |
Male
RNA viruses 0301 basic medicine Support Vector Machine Sustained Virologic Response Molecular biology Computer science Hepacivirus medicine.disease_cause computer.software_genre Genome Machine Learning Virological response Bayes' theorem Mathematical and Statistical Techniques Sequencing techniques 0302 clinical medicine DNA sequencing Pathology and laboratory medicine Multidisciplinary Artificial neural network Hepatitis C virus Applied Mathematics Simulation and Modeling Statistics virus diseases Genomics Medical microbiology Hepatitis C Physical Sciences Viruses Amino Acid Analysis RNA Viral Medicine Drug Therapy Combination Female 030211 gastroenterology & hepatology Pathogens Transcriptome Analysis Algorithms Research Article Next-Generation Sequencing Computer and Information Sciences Science Decision tree Genome Viral Research and Analysis Methods Machine learning Antiviral Agents Microbiology Machine Learning Algorithms 03 medical and health sciences Naive Bayes classifier Artificial Intelligence Support Vector Machines Genetics medicine Humans Statistical Methods Selection (genetic algorithm) Aged Medicine and health sciences Whole genome sequencing Molecular Biology Assays and Analysis Techniques Flaviviruses business.industry Organisms Viral pathogens Genetic Variation Biology and Life Sciences Computational Biology Cancer Bayes Theorem Genome Analysis Perceptron medicine.disease Hepatitis viruses digestive system diseases Microbial pathogens Support vector machine Data set Molecular biology techniques 030104 developmental biology Neural Networks Computer Artificial intelligence business computer Mathematics Forecasting |
Zdroj: | PLoS ONE, Vol 15, Iss 11, p e0242028 (2020) PLoS ONE |
ISSN: | 1932-6203 |
Popis: | In recent years, the development of diagnostics using artificial intelligence (AI) has been remarkable. AI algorithms can go beyond human reasoning and build diagnostic models from a number of complex combinations. Using next-generation sequencing technology, we identified hepatitis C virus (HCV) variants resistant to directing-acting antivirals (DAA) by whole genome sequencing of full-length HCV genomes, and applied these variants to various machine-learning algorithms to evaluate a preliminary predictive model. HCV genomic RNA was extracted from serum from 173 patients (109 with subsequent sustained virological response [SVR] and 64 without) before DAA treatment. HCV genomes from the 109 SVR and 64 non-SVR patients were randomly divided into a training data set (57 SVR and 29 non-SVR) and a validation-data set (52 SVR and 35 non-SVR). The training data set was subject to nine machine-learning algorithms selected to identify the optimized combination of functional variants in relation to SVR status following DAA therapy. Subsequently, the prediction model was tested by the validation-data set. The most accurate learning method was the support vector machine (SVM) algorithm (validation accuracy, 0.95; kappa statistic, 0.90; F-value, 0.94). The second-most accurate learning algorithm was Multi-layer perceptron. Unfortunately, Decision Tree, and Naive Bayes algorithms could not be fitted with our data set due to low accuracy (< 0.8). Conclusively, with an accuracy rate of 95.4% in the generalization performance evaluation, SVM was identified as the best algorithm. Analytical methods based on genomic analysis and the construction of a predictive model by machine-learning may be applicable to the selection of the optimal treatment for other viral infections and cancer. |
Databáze: | OpenAIRE |
Externí odkaz: |