Machine learning automatically detects COVID-19 using chest CTs in a large multicenter cohort

Autor: Youngjin Yoo, Alexander W. Sauter, Pina C. Sanelli, Philippe Grenier, Sasa Grbic, Savvas Nicolaou, Eduardo J. Mortani Barbosa, Dorin Comaniciu, Nakul Gupta, Bogdan Georgescu, François Mellot, Thomas Re, Jordi Broncano Cabrero, Valentin Ziebandt, Thomas Flohr, Gorka Bastarrika Alemañ, Guillaume Chabin, Shikha Chaganti
Rok vydání: 2021
Předmět:
FOS: Computer and information sciences
Computer Science - Machine Learning
medicine.medical_specialty
Viral pneumonia
Computer Vision and Pattern Recognition (cs.CV)
Computer Science - Computer Vision and Pattern Recognition
Logistic regression
Machine learning
computer.software_genre
Machine Learning (cs.LG)
030218 nuclear medicine & medical imaging
Machine Learning
03 medical and health sciences
0302 clinical medicine
Text mining
Discriminative model
FOS: Electrical engineering
electronic engineering
information engineering

medicine
Humans
Radiology
Nuclear Medicine and imaging

Tomography
Retrospective Studies
Neuroradiology
SARS-CoV-2
business.industry
Image and Video Processing (eess.IV)
COVID-19
Deep learning
General Medicine
Thorax
Electrical Engineering and Systems Science - Image and Video Processing
Classification
medicine.disease
3. Good health
Random forest
Pneumonia
Imaging Informatics and Artificial Intelligence
Feature (computer vision)
030220 oncology & carcinogenesis
Artificial intelligence
Radiology
business
computer
Zdroj: European Radiology
ISSN: 1432-1084
0938-7994
Popis: Objectives To investigate machine learning classifiers and interpretable models using chest CT for detection of COVID-19 and differentiation from other pneumonias, interstitial lung disease (ILD) and normal CTs. Methods Our retrospective multi-institutional study obtained 2446 chest CTs from 16 institutions (including 1161 COVID-19 patients). Training/validation/testing cohorts included 1011/50/100 COVID-19, 388/16/33 ILD, 189/16/33 other pneumonias, and 559/17/34 normal (no pathologies) CTs. A metric-based approach for the classification of COVID-19 used interpretable features, relying on logistic regression and random forests. A deep learning–based classifier differentiated COVID-19 via 3D features extracted directly from CT attenuation and probability distribution of airspace opacities. Results Most discriminative features of COVID-19 are the percentage of airspace opacity and peripheral and basal predominant opacities, concordant with the typical characterization of COVID-19 in the literature. Unsupervised hierarchical clustering compares feature distribution across COVID-19 and control cohorts. The metrics-based classifier achieved AUC = 0.83, sensitivity = 0.74, and specificity = 0.79 versus respectively 0.93, 0.90, and 0.83 for the DL-based classifier. Most of ambiguity comes from non-COVID-19 pneumonia with manifestations that overlap with COVID-19, as well as mild COVID-19 cases. Non-COVID-19 classification performance is 91% for ILD, 64% for other pneumonias, and 94% for no pathologies, which demonstrates the robustness of our method against different compositions of control groups. Conclusions Our new method accurately discriminates COVID-19 from other types of pneumonia, ILD, and CTs with no pathologies, using quantitative imaging features derived from chest CT, while balancing interpretability of results and classification performance and, therefore, may be useful to facilitate diagnosis of COVID-19. Key Points • Unsupervised clustering reveals the key tomographic features including percent airspace opacity and peripheral and basal opacities most typical of COVID-19 relative to control groups. • COVID-19-positive CTs were compared with COVID-19-negative chest CTs (including a balanced distribution of non-COVID-19 pneumonia, ILD, and no pathologies). Classification accuracies for COVID-19, pneumonia, ILD, and CT scans with no pathologies are respectively 90%, 64%, 91%, and 94%. • Our deep learning (DL)–based classification method demonstrates an AUC of 0.93 (sensitivity 90%, specificity 83%). Machine learning methods applied to quantitative chest CT metrics can therefore improve diagnostic accuracy in suspected COVID-19, particularly in resource-constrained environments. Supplementary Information The online version contains supplementary material available at 10.1007/s00330-021-07937-3.
Databáze: OpenAIRE