Performance of AI to exclude normal chest radiographs to reduce radiologists' workload.

Autor: Schalekamp S; Department of Imaging, Radboudumc, Nijmegen, The Netherlands. steven.schalekamp@radboudumc.nl., van Leeuwen K; Department of Imaging, Radboudumc, Nijmegen, The Netherlands., Calli E; Department of Imaging, Radboudumc, Nijmegen, The Netherlands., Murphy K; Department of Imaging, Radboudumc, Nijmegen, The Netherlands., Rutten M; Department of Imaging, Radboudumc, Nijmegen, The Netherlands.; Department of Radiology, Jeroen Bosch Ziekenhuis, 's Hertogenbosch, The Netherlands., Geurts B; Department of Imaging, Radboudumc, Nijmegen, The Netherlands., Peters-Bax L; Department of Imaging, Radboudumc, Nijmegen, The Netherlands., van Ginneken B; Department of Imaging, Radboudumc, Nijmegen, The Netherlands., Prokop M; Department of Imaging, Radboudumc, Nijmegen, The Netherlands.
Jazyk: angličtina
Zdroj: European radiology [Eur Radiol] 2024 Nov; Vol. 34 (11), pp. 7255-7263. Date of Electronic Publication: 2024 May 17.
DOI: 10.1007/s00330-024-10794-5
Abstrakt: Introduction: This study investigates the performance of a commercially available artificial intelligence (AI) system to identify normal chest radiographs and its potential to reduce radiologist workload.
Methods: Retrospective analysis included consecutive chest radiographs from two medical centers between Oct 1, 2016 and Oct 14, 2016. Exclusions comprised follow-up exams within the inclusion period, bedside radiographs, incomplete images, imported radiographs, and pediatric radiographs. Three chest radiologists categorized findings into normal, clinically irrelevant, clinically relevant, urgent, and critical. A commercial AI system processed all radiographs, scoring 10 chest abnormalities on a 0-100 confidence scale. AI system performance was evaluated using the area under the ROC curve (AUC), assessing the detection of normal radiographs. Sensitivity was calculated for the default and a conservative operating point. the detection of negative predictive value (NPV) for urgent and critical findings, as well as the potential workload reduction, was calculated.
Results: A total of 2603 radiographs were acquired in 2141 unique patients. Post-exclusion, 1670 radiographs were analyzed. Categories included 479 normal, 332 clinically irrelevant, 339 clinically relevant, 501 urgent, and 19 critical findings. The AI system achieved an AUC of 0.92. Sensitivity for normal radiographs was 92% at default and 53% at the conservative operating point. At the conservative operating point, NPV was 98% for urgent and critical findings, and could result in a 15% workload reduction.
Conclusion: A commercially available AI system effectively identifies normal chest radiographs and holds the potential to lessen radiologists' workload by omitting half of the normal exams from reporting.
Clinical Relevance Statement: The AI system is able to detect half of all normal chest radiographs at a clinically acceptable operating point, thereby potentially reducing the workload for the radiologists by 15%.
Key Points: The AI system reached an AUC of 0.92 for the detection of normal chest radiographs. Fifty-three percent of normal chest radiographs were identified with a NPV of 98% for urgent findings. AI can reduce the workload of chest radiography reporting by 15%.
(© 2024. The Author(s).)
Databáze: MEDLINE