Algorithmic transparency and interpretability measures improve radiologists' performance in BI-RADS 4 classification.
Autor: | Jungmann F; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Ziegelmayer S; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Lohoefer FK; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Metz S; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Müller-Leisse C; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Englmaier M; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Makowski MR; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany., Kaissis GA; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany.; Department of Computing, Faculty of Engineering, Imperial College of Science, Technology and Medicine, London, SW7 2AZ, UK.; Institute for Artificial Intelligence in Medicine and Healthcare, School of Medicine and Faculty of Informatics, Technical University of Munich, 81675, Munich, Germany., Braren RF; Institute of Diagnostic and Interventional Radiology, School of Medicine, Technical University of Munich, 81675, Munich, Germany. rbraren@tum.de.; German Cancer Consortium (DKTK) Partner Site Munich, 69120, Heidelberg, Germany. rbraren@tum.de. |
---|---|
Jazyk: | angličtina |
Zdroj: | European radiology [Eur Radiol] 2023 Mar; Vol. 33 (3), pp. 1844-1851. Date of Electronic Publication: 2022 Oct 25. |
DOI: | 10.1007/s00330-022-09165-9 |
Abstrakt: | Objective: To evaluate the perception of different types of AI-based assistance and the interaction of radiologists with the algorithm's predictions and certainty measures. Methods: In this retrospective observer study, four radiologists were asked to classify Breast Imaging-Reporting and Data System 4 (BI-RADS4) lesions (n = 101 benign, n = 99 malignant). The effect of different types of AI-based assistance (occlusion-based interpretability map, classification, and certainty) on the radiologists' performance (sensitivity, specificity, questionnaire) were measured. The influence of the Big Five personality traits was analyzed using the Pearson correlation. Results: Diagnostic accuracy was significantly improved by AI-based assistance (an increase of 2.8% ± 2.3%, 95 %-CI 1.5 to 4.0 %, p = 0.045) and trust in the algorithm was generated primarily by the certainty of the prediction (100% of participants). Different human-AI interactions were observed ranging from nearly no interaction to humanization of the algorithm. High scores in neuroticism were correlated with higher persuasibility (Pearson's r = 0.98, p = 0.02), while higher consciousness and change of accuracy showed an inverse correlation (Pearson's r = -0.96, p = 0.04). Conclusion: Trust in the algorithm's performance was mostly dependent on the certainty of the predictions in combination with a plausible heatmap. Human-AI interaction varied widely and was influenced by personality traits. Key Points: • AI-based assistance significantly improved the diagnostic accuracy of radiologists in classifying BI-RADS 4 mammography lesions. • Trust in the algorithm's performance was mostly dependent on the certainty of the prediction in combination with a reasonable heatmap. • Personality traits seem to influence human-AI collaboration. Radiologists with specific personality traits were more likely to change their classification according to the algorithm's prediction than others. (© 2022. The Author(s).) |
Databáze: | MEDLINE |
Externí odkaz: |