Deep learning detection of diabetic retinopathy in Scotland's diabetic eye screening programme.

Autor: Fleming AD; The Institute of Genetics and Cancer, University of Edinburgh Western General Hospital, Edinburgh, UK., Mellor J; Usher Institute, The University of Edinburgh, Edinburgh, UK joe.mellor@ed.ac.uk., McGurnaghan SJ; The Institute of Genetics and Cancer, University of Edinburgh Western General Hospital, Edinburgh, UK., Blackbourn LAK; The Institute of Genetics and Cancer, University of Edinburgh Western General Hospital, Edinburgh, UK., Goatman KA; King's College, Aberdeen, UK., Styles C; Queen Margaret Hospital, NHS Fife, Dunfermline, Fife, UK., Storkey AJ; School of Informatics, The University of Edinburgh, Edinburgh, UK., McKeigue PM; Usher Institute, The University of Edinburgh, Edinburgh, UK., Colhoun HM; The Institute of Genetics and Cancer, University of Edinburgh Western General Hospital, Edinburgh, UK.
Jazyk: angličtina
Zdroj: The British journal of ophthalmology [Br J Ophthalmol] 2024 Jun 20; Vol. 108 (7), pp. 984-988. Date of Electronic Publication: 2024 Jun 20.
DOI: 10.1136/bjo-2023-323395
Abstrakt: Background/aims: Support vector machine-based automated grading (known as iGradingM) has been shown to be safe, cost-effective and robust in the diabetic retinopathy (DR) screening (DES) programme in Scotland. It triages screening episodes as gradable with no DR versus manual grading required. The study aim was to develop a deep learning-based autograder using images and gradings from DES and to compare its performance with that of iGradingM.
Methods: Retinal images, quality assurance (QA) data and routine DR grades were obtained from national datasets in 179 944 patients for years 2006-2016. QA grades were available for 744 images. We developed a deep learning-based algorithm to detect whether either eye contained ungradable images or any DR. The sensitivity and specificity were evaluated against consensus QA grades and routine grades.
Results: Images used in QA which were ungradable or with DR were detected by deep learning with better specificity compared with manual graders (p<0.001) and with iGradingM (p<0.001) at the same sensitivities. Any DR according to the DES final grade was detected with 89.19% (270 392/303 154) sensitivity and 77.41% (500 945/647 158) specificity. Observable disease and referable disease were detected with sensitivities of 96.58% (16 613/17 201) and 98.48% (22 600/22 948), respectively. Overall, 43.84% of screening episodes would require manual grading.
Conclusion: A deep learning-based system for DR grading was evaluated in QA data and images from 11 years in 50% of people attending a national DR screening programme. The system could reduce the manual grading workload at the same sensitivity compared with the current automated grading system.
Competing Interests: Competing interests: HMC is the principal investigator on the JDRF UK grant. The employment of ADF and JM was with this funding. HMC and PMM have declared stock options in Bayer AG and Roche Pharmaceuticals. HMC has received grants from AstraZeneca LP, Regeneron, Pfizer, Novo Nordisk and Eli Lilly and Company, and is on the advisory panels or boards of Novo Nordisk, Eli Lilly and Company, Regeneron, Novartis Pharmaceuticals, Bayer AG and Sanofi-Aventis. HMC has received payments for speakers bureaus and honoraria from Eli Lilly and Company, Regeneron and Novartis Pharmaceuticals.
(© Author(s) (or their employer(s)) 2024. No commercial re-use. See rights and permissions. Published by BMJ.)
Databáze: MEDLINE