Evaluating ChatGPT's Capabilities on Orthopedic Training Examinations: An Analysis of New Image Processing Features.

Autor: Posner KM; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA., Bakus C; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA., Basralian G; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA., Chester G; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA., Zeiman M; Department of Orthopedic Surgery, Hackensack Meridian School of Medicine, Nutley, USA., O'Malley GR; Department of Orthopedic Surgery, Hackensack University Medical Center, Hackensack, USA., Klein GR; Department of Orthopedic Surgery, Hackensack University Medical Center, Hackensack, USA.
Jazyk: angličtina
Zdroj: Cureus [Cureus] 2024 Mar 11; Vol. 16 (3), pp. e55945. Date of Electronic Publication: 2024 Mar 11 (Print Publication: 2024).
DOI: 10.7759/cureus.55945
Abstrakt: Introduction The efficacy of integrating artificial intelligence (AI) models like ChatGPT into the medical field, specifically orthopedic surgery, has yet to be fully determined. The most recent adaptation of ChatGPT that has yet to be explored is its image analysis capabilities. This study assesses ChatGPT's performance in answering Orthopedic In-Training Examination (OITE) questions, including those that require image analysis. Methods Questions from the 2014, 2015, 2021, and 2022 AAOS OITE were screened for inclusion. All questions without images were entered into ChatGPT 3.5 and 4.0 twice. Questions that necessitated the use of images were only entered into ChatGPT 4.0 twice, as this is the only version of the system that can analyze images. The responses were recorded and compared to AAOS's correct answers, evaluating the AI's accuracy and precision. Results A total of 940 questions were included in the final analysis (457 questions with images and 483 questions without images). ChatGPT 4.0 performed significantly better on questions that did not require image analysis (67.81% vs 47.59%, p<0.001). Discussion While the use of AI in orthopedics is an intriguing possibility, this evaluation demonstrates how, even with the addition of image processing capabilities, ChatGPT still falls short in terms of its accuracy. As AI technology evolves, ongoing research is vital to harness AI's potential effectively, ensuring it complements rather than attempts to replace the nuanced skills of orthopedic surgeons.
Competing Interests: The authors have declared that no competing interests exist.
(Copyright © 2024, Posner et al.)
Databáze: MEDLINE