Radiomics and Deep Features: Robust Classification of Brain Hemorrhages and Reproducibility Analysis Using a 3D Autoencoder Neural Network.

Autor: Bijari S; Department of Radiology, Faculty of Paramedical, Kurdistan University of Medical Sciences, Sanandaj P.O. Box 66177-13446, Iran., Sayfollahi S; Department of Neurosurgery, School of Medicine, Iran University of Medical Sciences, Tehran P.O. Box 14496-14535, Iran., Mardokh-Rouhani S; Mechanical Engineering Group, Faculty of Engineering, University of Kurdistan, Sanandaj P.O. Box 66177-15175, Iran., Bijari S; Department of Aging and Health, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd P.O. Box 89151-73160, Iran., Moradian S; Department of Radiology, Tehran University of Medical Sciences, Tehran P.O. Box 14197-33151, Iran., Zahiri Z; Department of Radiation Oncology, Golestan Hospital, Ahvaz Jundishapur University of Medical Sciences, Ahvaz P.O. Box 61357-15794, Iran., Rezaeijo SM; Department of Medical Physics, Faculty of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz P.O. Box 61357-15794, Iran.; Cancer Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz P.O. Box 61357-15794, Iran.
Jazyk: angličtina
Zdroj: Bioengineering (Basel, Switzerland) [Bioengineering (Basel)] 2024 Jun 24; Vol. 11 (7). Date of Electronic Publication: 2024 Jun 24.
DOI: 10.3390/bioengineering11070643
Abstrakt: This study evaluates the reproducibility of machine learning models that integrate radiomics and deep features (features extracted from a 3D autoencoder neural network) to classify various brain hemorrhages effectively. Using a dataset of 720 patients, we extracted 215 radiomics features (RFs) and 15,680 deep features (DFs) from CT brain images. With rigorous screening based on Intraclass Correlation Coefficient thresholds (>0.75), we identified 135 RFs and 1054 DFs for analysis. Feature selection techniques such as Boruta, Recursive Feature Elimination (RFE), XGBoost, and ExtraTreesClassifier were utilized alongside 11 classifiers, including AdaBoost, CatBoost, Decision Trees, LightGBM, Logistic Regression, Naive Bayes, Neural Networks, Random Forest, Support Vector Machines (SVM), and k-Nearest Neighbors (k-NN). Evaluation metrics included Area Under the Curve (AUC), Accuracy (ACC), Sensitivity (SEN), and F1-score. The model evaluation involved hyperparameter optimization, a 70:30 train-test split, and bootstrapping, further validated with the Wilcoxon signed-rank test and q-values. Notably, DFs showed higher accuracy. In the case of RFs, the Boruta + SVM combination emerged as the optimal model for AUC, ACC, and SEN, while XGBoost + Random Forest excelled in F1-score. Specifically, RFs achieved AUC, ACC, SEN, and F1-scores of 0.89, 0.85, 0.82, and 0.80, respectively. Among DFs, the ExtraTreesClassifier + Naive Bayes combination demonstrated remarkable performance, attaining an AUC of 0.96, ACC of 0.93, SEN of 0.92, and an F1-score of 0.92. Distinguished models in the RF category included SVM with Boruta, Logistic Regression with XGBoost, SVM with ExtraTreesClassifier, CatBoost with XGBoost, and Random Forest with XGBoost, each yielding significant q-values of 42. In the DFs realm, ExtraTreesClassifier + Naive Bayes, ExtraTreesClassifier + Random Forest, and Boruta + k-NN exhibited robustness, with 43, 43, and 41 significant q-values, respectively. This investigation underscores the potential of synergizing DFs with machine learning models to serve as valuable screening tools, thereby enhancing the interpretation of head CT scans for patients with brain hemorrhages.
Databáze: MEDLINE
Nepřihlášeným uživatelům se plný text nezobrazuje