Systematic investigation into generalization of COVID-19 CT deep learning models with Gabor ensemble for lung involvement scoring
Autor: | Michael J Horry, Subrata Chakraborty, Biswajeet Pradhan, maryam fallahpoor, Chegeni Hossein, Manoranjan Paul |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning engrXiv|Engineering|Biomedical Engineering and Bioengineering bepress|Engineering engrXiv|Engineering|Computer Engineering engrXiv|Engineering|Computer Engineering|Other Computer Engineering Computer Vision and Pattern Recognition (cs.CV) Image and Video Processing (eess.IV) Computer Science - Computer Vision and Pattern Recognition bepress|Engineering|Biomedical Engineering and Bioengineering Electrical Engineering and Systems Science - Image and Video Processing Machine Learning (cs.LG) bepress|Engineering|Computer Engineering|Other Computer Engineering engrXiv|Engineering FOS: Electrical engineering electronic engineering information engineering bepress|Engineering|Computer Engineering |
Popis: | The COVID-19 pandemic has inspired unprecedented data collection and computer vision modelling efforts worldwide, focusing on diagnosis and stratification of COVID-19 from medical images. Despite this large-scale research effort, these models have found limited practical application due in part to unproven generalization of these models beyond their source study. This study investigates the generalizability of key published models using the publicly available COVID-19 Computed Tomography data through cross dataset validation. We then assess the predictive ability of these models for COVID-19 severity using an independent new dataset that is stratified for COVID-19 lung involvement. Each inter-dataset study is performed using histogram equalization, and contrast limited adaptive histogram equalization with and without a learning Gabor filter. The study shows high variability in the generalization of models trained on these datasets due to varied sample image provenances and acquisition processes amongst other factors. We show that under certain conditions, an internally consistent dataset can generalize well to an external dataset despite structural differences between these datasets with f1 scores up to 86%. Our best performing model shows high predictive accuracy for lung involvement score for an independent dataset for which expertly labelled lung involvement stratification is available. Creating an ensemble of our best model for disease positive prediction with our best model for disease negative prediction using a min-max function resulted in a superior model for lung involvement prediction with average predictive accuracy of 75% for zero lung involvement and 96% for 75-100% lung involvement with almost linear relationship between these stratifications. 39 Pages, 8 figures, 14 tables comparing the generalization of COVID-19 CT Deep Learning Models |
Databáze: | OpenAIRE |
Externí odkaz: |