Efficient and Reliable Data Extraction in Radiation Oncology using Python Programming Language: A Pilot Study.

Autor: Chauhan RS; Department of Physics, GLA University, Mathura, Uttar Pradesh, India.; Department of Radiation Oncology, Manipal Hospitals, Dwarka, New Delhi, India., Pradhan A; Centre for Cosmology, Astrophysics and Space Science, GLA University, Mathura, Uttar Pradesh, India., Munshi A; Department of Radiation Oncology, Manipal Hospitals, Dwarka, New Delhi, India., Mohanti BK; KIMS Cancer Centre, Kalinga Institute of Medical Sciences, KIIT University, Bhubaneswar, Odisha, India.
Jazyk: angličtina
Zdroj: Journal of medical physics [J Med Phys] 2023 Jan-Mar; Vol. 48 (1), pp. 13-18. Date of Electronic Publication: 2023 Apr 18.
DOI: 10.4103/jmp.jmp_12_23
Abstrakt: Background and Purpose: In recent years, data science approaches have entered health-care systems such as radiology, pathology, and radiation oncology. In our pilot study, we developed an automated data mining approach to extract data from a treatment planning system (TPS) with high speed, maximum accuracy, and little human interaction. We compared the amount of time required for manual data extraction versus the automated data mining technique.
Materials and Methods: A Python programming script was created to extract specified parameters and features pertaining to patients and treatment (a total of 25 features) from TPS. We successfully implemented automation in data mining, utilizing the application programming interface environment provided by the external beam radiation therapy equipment provider for the whole group of patients who were accepted for treatment.
Results: This in-house Python-based script extracted selected features for 427 patients in 0.28 ± 0.03 min with 100% accuracy at an astonishing rate of 0.04 s/plan. Comparatively, manual extraction of 25 parameters took an average of 4.5 ± 0.33 min/plan, along with associated transcriptional and transpositional errors and missing data information. This new approach turned out to be 6850 times faster than the conventional approach. Manual feature extraction time increased by a factor of nearly 2.5 if we doubled the number of features extracted, whereas for the Python script, it increased by a factor of just 1.15.
Conclusion: We conclude that our in-house developed Python script can extract plan data from TPS at a far higher speed (>6000 times) and with the best possible accuracy compared to manual data extraction.
Competing Interests: There are no conflicts of interest.
(Copyright: © 2023 Journal of Medical Physics.)
Databáze: MEDLINE