DeepSGP: Deep Learning for Gene Selection and Survival Group Prediction in Glioblastoma
Autor: | B. Uma Shankar, Subhashis Banerjee, Raghunath Chatterjee, Sayantan Laha, Ritaban Kirtania, Sushmita Mitra |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
0301 basic medicine
differentially expressed genes TK7800-8360 Computer Networks and Communications Feature extraction Genomics Computational biology Biology Correlation RNA sequencing data 03 medical and health sciences 0302 clinical medicine Glioma medicine Electrical and Electronic Engineering survival prediction Selection (genetic algorithm) business.industry Deep learning glioblastoma deep learning medicine.disease Autoencoder 030104 developmental biology Hardware and Architecture Control and Systems Engineering Feature (computer vision) 030220 oncology & carcinogenesis Signal Processing Artificial intelligence Electronics business |
Zdroj: | Electronics, Vol 10, Iss 1463, p 1463 (2021) Electronics Volume 10 Issue 12 |
ISSN: | 2079-9292 |
Popis: | Glioblastoma Multiforme (GBM) is an aggressive form of glioma, exhibiting very poor survival. Genomic input, in the form of RNA sequencing data (RNA-seq), is expected to provide vital information about the characteristics of the genes that affect the Overall Survival (OS) of patients. This could have a significant impact on treatment planning. We present a new Autoencoder (AE)-based strategy for the prediction of survival (low or high) of GBM patients, using the RNA-seq data of 129 GBM samples from The Cancer Genome Atlas (TCGA). This is a novel interdisciplinary approach to integrating genomics with deep learning towards survival prediction. First, the Differentially Expressed Genes (DEGs) were selected using EdgeR. These were further reduced using correlation-based analysis. This was followed by the application of ranking with different feature subset selection and feature extraction algorithms, including the AE. In each case, fifty features were selected/extracted, for subsequent prediction with different classifiers. An exhaustive study for survival group prediction, using eight different classifiers with the accuracy and Area Under the Curve (AUC), established the superiority of the AE-based feature extraction method, called DeepSGP. It produced a very high accuracy (0.83) and AUC (0.90). Of the eight classifiers, using the extracted features by DeepSGP, the MLP was the best at Overall Survival (OS) prediction with an accuracy of 0.89 and an AUC of 0.97. The biological significance of the genes extracted by the AE were also analyzed to establish their importance. Finally, the statistical significance of the predicted output of the DeepSGP algorithm was established using the concordance index. |
Databáze: | OpenAIRE |
Externí odkaz: |