Improved Performance of ChatGPT-4 on the OKAP Examination: A Comparative Study with ChatGPT-3.5.

Autor: Teebagy S; Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts., Colwell L; Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts., Wood E; Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts., Yaghy A; Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts., Faustina M; Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts.
Jazyk: angličtina
Zdroj: Journal of academic ophthalmology (2017) [J Acad Ophthalmol (2017)] 2023 Sep 11; Vol. 15 (2), pp. e184-e187. Date of Electronic Publication: 2023 Sep 11 (Print Publication: 2023).
DOI: 10.1055/s-0043-1774399
Abstrakt: Introduction:  This study aims to evaluate the performance of ChatGPT-4, an advanced artificial intelligence (AI) language model, on the Ophthalmology Knowledge Assessment Program (OKAP) examination compared to its predecessor, ChatGPT-3.5. Methods:  Both models were tested on 180 OKAP practice questions covering various ophthalmology subject categories. Results:  ChatGPT-4 significantly outperformed ChatGPT-3.5 (81% vs. 57%; p <0.001), indicating improvements in medical knowledge assessment. Discussion:  The superior performance of ChatGPT-4 suggests potential applicability in ophthalmologic education and clinical decision support systems. Future research should focus on refining AI models, ensuring a balanced representation of fundamental and specialized knowledge, and determining the optimal method of integrating AI into medical education and practice.
Competing Interests: Conflict of Interest None declared.
(The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. ( https://creativecommons.org/licenses/by-nc-nd/4.0/ ).)
Databáze: MEDLINE